This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 

As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



<12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
18 January 2001 (18.01.2001) 




PCT 



(10) International Publication Number 

WO 01/04344 A2 



(51) International Patent Classification 7 : C12Q 1/00 

(21) International Application Number: PCT/USOO/18828 

(22) International Filing Date: 10 July 2000 (10.07.2000) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

6GU43/757 



8 July 1999(08.07.1999) US 



(63) Related by continuation (CON) or continuation-in-part 
(C1P) to earlier application: 

US 60/143,757 (CIP) 

Filed on 8 July 1999 (08.07. 1 999) 

(71) Applicant (for all designated States except US): 
STRESSGEN BIOTECHNOLOGIES CORPORA- 
TION [CA/CA1; 350-4243 Glanford Avenue, Victoria, 
British Columbia V8Z 4B9 (CA). 



(72) Inventors; and 

(75) Inventors/Applicants (for US only): SIEGEL, Marvin 
[USrtJS]; 150 Somerset Drive, Blue Bell. PA 19422 (US). 
CHU, N., Randall [CA/CA]; 2225 Windsor Road, Victo- 
ria, British Columbia V8S 3C8 (CA). MIZZEN, Lee, A. 
[CA/CA]; 1936 Quamichan Street, Victoria, British Co- 
lumbia V8S 2C4 (CA). 

(74) Agent: FRASER, Jan is, K.; Fish & Richardson, EC, 225 
Franklin Street, Boston, MA 021 10-2804 (US). 

(81) Designated States (national): AE, AG. AL, AM. AT, AU, 
AZ, BA. BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU, CZ, 
DE. DK, DM, DZ, EE, ES, FI. GB, GD, GE, GH, GM, HR, 
HU, ID, 1L, IN, IS, JP, KE, KG. KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MA, MD, MG, M K, MN, MW, MX, MZ, 
NO, NZ, PL, FT, RO, RU, SD, SE, SG. SI, SK, SL, TJ, TM, 
TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA. ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS. MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ. BY, KG, KZ, MD. RU, TLTM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR. IE, 

[Continued on next page] 



= (54) Title: INDUCTION OF A TH1-L1KE RESPONSE IN VITRO 



A) SOURCE OF MOUSE: CHARLES RIVER LABS 



8000 



IFN-g (pg/mL) 



< 




STIMULATING ANTIGEN 
CONCENTRATION (nmol7mL) 



m I* 

Qo.46 
□ 0.15 
E2 0.05 



1 

COMPLETE Ksp65 
RPMI 



"T 
E7 



Hsp65 HspE7 (h)E7 
*E7 



STIMULATING ANTIGEN 

(57) Abstract: The invention provides compositions and methods for stimulating a Thl-like response in vitro. Compositions include 
fusion proteins and conjugates that contain at least a portion of a heat shock protein. A Thl-like response can be elicited by contacting 
in vitro a cell sample containing naive lymphocytes with a fusion protein or conjugate of the invention. The Thl-like response can 
be detected by measuring IFN-gamma produced by the cell sample. 



WO 01/04344 A2 1 lllll lililll y Ullll ill! I li W IIIH IEIII IIIU lllfl Hit IIIIIR 1111 IIII (II! 



IT, LU, MC, NL, FT, SE), OAPI patent (BF, BJ, CF, CG, 
CI. CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— Without international search report and to be republished 
upon receipt of that report. 



For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



WO 01/04344 



PCT/US00/18828 



INDUCTION OF A TH1-LIKE RESPONSE IN VITRO 

Cross Reference to Related Applications 
This application claims priority from U.S. Provisional Application 
5 No. 60/143,757, filed July 8, 1999. The content of this application is incorporated 
herein by reference in its entirety. 

Field of the Invention 
The invention relates to fusion proteins and methods of stimulating a Thl- 
1 0 like response in vitro. 

Background 

T lymphocytes can generally be divided into two classes based upon expression 
of the CD4 and CD8 antigens. The immune response mediated by CD4+ T cells is 

15 restricted by class II major histocompatibility complex (MHC) molecules. CD4+ T 
cells, also known as helper T lymphocytes, carry out their helper functions via the 
secretion of lymphokines. The immune response mediated by CD8+ T cells is 
restricted by class I MHC molecules. CD 8+ T cells, also known as cytolytic T 
lymphocytes (CTLs), carry out cell mediated cytotoxicity and also secrete some 

20 lymphokines upon activation. 

CD4+ T cells can be further divided into Thl and Th2 subsets. Thl cells 
participate in cell mediated immunity by producing lymphokines, such as interferon 
(IFN)-gamma and tumor necrosis factor (TNF)-beta, that activate cell mediated 
immunity. Th2 cells provide help for humoral immunity by secreting lymphokines that 

25 stimulate B cells, such as EL-4 and EL-5. Antigenic stimuli that activate either the Thl 
or Th2 pathway can inhibit the development of the other. For example, IFN-ganima 
produced by a stimulated Thl cell can inhibit the formation of Th2 cells, and IL-4 
produced by a stimulated Th2 cell can inhibit the formation of Thl cells. 

Certain disease conditions, such as cancer, allergy, and parasitic infections, are 

30 characterized by a predominantly Th2 response. Under certain circumstances, the 
induction of the Thl response, typified by the production of IFN-gamma, may 
ameliorate these conditions. 
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Summary of the Invention 
The invention is based on the discovery that a cell sample containing naive 
lymphocytes can be stimulated in vitro to exhibit a Thl -like response. 

Accordingly, the invention features a method of determining whether a fusion 
5 protein stimulates a Thl -like response by: (a) providing a cell sample containing naive 
lymphocytes in vitro; (b) providing a fusion protein containing (i) a heat shock protein 
(Hsp) or a fragment thereof at least eight amino acid residues in length, fused to (ii) a 
heterologous polypeptide at least eight amino acid residues in length; (c) contacting the 
cell sample with the fusion protein; and (d) determining whether the fusion protein 
1 0 stimulates a Th 1 -like response in the cell sample. 

"Naive lymphocytes" are lymphocytes that have not been exposed to the fusion 
protein (in vivo or in vitro) prior to their use in a method the invention. An "Hsp" is a 
polypeptide consisting of a sequence that is at least 40% identical to that of a protein 
whose expression is induced or enhanced in a cell exposed to stress, e.g., heat shock. A 
15 "fusion protein" is a non-naturally occurring polypeptide containing amino acid 
sequences derived from at least two different proteins. 

The Hsp used in the method can be selected from the group consisting of 
Hsp65, Hsp40, Hsp 10, Hsp60, and Hsp71. Additionally, the fusion protein can contain 
the full amino acid sequence of any of Hsp65, Hsp40, HsplO, Hsp60, or Hsp71. In 
20 some embodiments, the fusion protein contains a fragment of an Hsp, e.g., amino acids 
1-200 of Hsp65 of Mycobacterium bovis. 

The heterologous polypeptide can contain a sequence identical to at least eight 
consecutive amino acids of (i) a protein of a human pathogen, e.g., a virus, or (ii) a 
tumor associated antigen. Examples of viruses include human papilloma virus (HPV), 
25 herpes simplex virus (HSV), hepatitis B virus (HBV), hepatitis C virus (HCV), 

cytomegalovirus (CMV), Epstein-Barr virus (EBV), influenza virus, measles virus, and 
human immunodeficiency virus (HIV). The heterologous polypeptide can contain an 
HPV E6 antigen, e.g., HPV16 E6, an HPV E7 antigen, e.g., HPV16E7, or a fragment 
of any of these antigens that is at least eight amino acid residues in length. 
30 In one example, the fusion protein contains Mycobacterium bovis BCG Hsp65 

and HPV 16 E7. 

The cell sample used in the methods of the invention can contain cells derived 
from a spleen, lymph node, peripheral blood, bone marrow, thymus, lung, respiratory 
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tract, or anogenital mucosa. In preferred embodiments, the cells are splenocytes or 
lymph node cells. 

The stimulation of a Thl-like response can be determined by detecting the 
presence of a lymphokine produced by the cell sample, e.g. EFN-gamma or TNF-beta. 

5 In one embodiment, the method also includes the steps of: (e) providing a 

second cell sample containing naive lymphocytes; (f) contacting the second cell sample 
with a second fusion protein; and (g) determining whether the second fusion protein 
stimulates a Thl-like response in the second cell sample. In this example, the first 
fusion protein contains the sequence of a full-length, naturally occurring Hsp, and the 

10 second fusion protein contains at least eight amino acids but less than all of the 
sequence of a naturally occurring Hsp. 

In another aspect, the invention features a method of screening a compound by: 
(a) providing a cell sample containing naive lymphocytes in vitro; (b) providing a 
fusion protein containing (i) a Hsp or a fragment thereof at least eight amino acid 

15 residues in length, fused to (ii) a heterologous polypeptide at least eight amino acid 
residues in length; (c) contacting the cell sample with the compound and the fusion 
protein; and (d) determining whether the cell sample exhibits a Thl-like response 
following the contacting step. In this method, a decrease in the Thl-like response in 
the presence of the compound compared to in the absence of the compound indicates 

20 that the compound inhibits a Thl-like response by the cell sample. 

The invention also includes a method of screening a compound by: (a) 
providing a cell sample containing naive lymphocytes in vitro; (b) providing a fusion 
protein containing (i) a Hsp or a fragment thereof at least eight amino acid residues in 
length, fused to (ii) a heterologous polypeptide at least eight amino acid residues in 

25 length; (c) contacting the cell sample with the compound and the fusion protein; and (d) 
determining whether the cell sample exhibits a Thl-like response following the 
contacting step. In this method, an increase in the Thl-like response in the presence of 
the compound compared to in the absence of the compound indicates that the 
compound promotes a Thl -like response by the cell sample. 

30 In another aspect, the invention features a method of determining whether a 

hybrid compound stimulates a Thl-like response by: (a) providing a cell sample 
containing naive lymphocytes in vitro; (b) providing a hybrid compound that is non- 
naturally occurring and contains (i) a non-peptide compound having a molecular weight 



3 



WO 01/04344 



PCT/US00/18828 



of less than 1,500, covalently linked to (ii) a polypeptide of at least eight amino acids in 
length, wherein the hybrid compound is made by covalently linking the non-peptide 
compound to the polypeptide; (c) contacting the cell sample with the hybrid compound; 
and (d) determining whether the hybrid compound stimulates a Thl-like response in the 
5 cell sample. In one embodiment, the non-peptide compound has a molecular weight of 
at least 100. 

In another aspect, the invention features a method of determining whether a 
hybrid compound stimulates a Thl-like response by: (a) producing a hybrid compound 
by covalently linking a non-peptide compound to a polypeptide of at least eight amino 
1 0 acids in length; (b) providing a cell sample containing naive lymphocytes in vitro; (c) 
contacting the cell sample with the hybrid compound; and (d) determining whether the 
hybrid compound stimulates a Thl-like response in the cell sample. In one 
embodiment, the non-peptide compound has a molecular weight between 100 and 
1,500. 

1 5 In another aspect, the invention features a method of determining whether a 

fusion protein stimulates a Thl-like response by: (a) providing a cell sample containing 
naive lymphocytes in vitro; (b) providing a fusion protein comprising (i) a first 
polypeptide at least eight amino acids in length, fused to (ii) a second polypeptide at 
least eight amino acids in length; (c) contacting the cell sample with the fusion protein; 

20 and (d) detecting a Thl-like response exhibited by the cell sample following the 

contacting step. In one embodiment, the detected Thl-like response is greater than a 
Thl-like response exhibited by a second cell sample containing naive lymphocytes 
when the second cell sample is contacted with either the first polypeptide, the second 
polypeptide, or a mixture of the first polypeptide and the second polypeptide. In one 

25 example, the detected Thl -like response is at least two times greater than the Thl-like 
response exhibited by the second cell sample. In another example, the detected Thl- 
like response is at least five times greater than the Thl-like response exhibited by the 
second cell sample. 

In another aspect, the invention provides a fusion protein containing (i) a HsplO 
30 protein or a fragment thereof at least eight amino acid residues in length, and (ii) a 

heterologous polypeptide at least eight amino acids in length. The HsplO protein of the 
fusion protein can be a mycobacterial protein, e.g., Mycobacterium tuberculosis HsplO 
protein. The heterologous polypeptide can contain a sequence identical to at least eight 
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consecutive amino acids of a protein of a human vims, e.g., HPV. In one example, the 
heterologous polypeptide contains HPV 16 E7. 

In another aspect, the invention provides a fusion protein containing (i) a Hsp40 
protein or a fragment thereof at least eight amino acid residues in length, and (ii) a 

5 heterologous polypeptide at least eight amino acids in length. The Hsp40 protein of the 
fusion protein can be a mycobacterial protein, e.g., Mycobacterium tuberculosis Hsp40 
protein. The heterologous polypeptide can contain a sequence identical to at least eight 
consecutive amino acids of a protein of a human virus, e.g., HPV. In one example, the 
heterologous polypeptide contains HPV16 E7. 

1 0 In another aspect, the invention provides a fusion protein containing (i) a Hsp71 

protein or a fragment thereof at least eight amino acid residues in length, and (ii) a 
heterologous polypeptide at least eight amino acids in length. The Hsp71 protein of the 
fusion protein can be a mycobacterial protein, e.g., Mycobacterium tuberculosis Hsp71 
protein. The heterologous polypeptide can contain a sequence identical to at least eight 

15 consecutive amino acids of a protein of a human virus, e.g., HPV. In one example, the 
heterologous polypeptide contains HP VI 6 E7. 

In another aspect, the invention features a method of determining whether a 
compound stimulates a Thl-like response by: (a) providing a cell sample containing 
naive lymphocytes in vitro; (b) providing a compound; (c) contacting the cell sample 

20 with the compound; and (d) detecting a Thl-like response exhibited by the cell sample 
following the contacting step. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Although methods and materials similar or equivalent to those 

25 described herein can be used in the practice or testing of the present invention, the 
preferred methods and materials are described below. All publications, patent 
applications, patents, and other references mentioned herein are incorporated by 
reference in their entirety. In case of conflict, the present application, including 
definitions, will control. In addition, the materials, methods, and examples are 

30 illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the 
following detailed description, and from the claims. 
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Brief Description of the Drawings 
Figures 1 A-1B show the sequence of plasmid pET65 coding for expression of 

Hsp65. 

Figure 2 shows the sequence of plasmid pETYE7 (NH) coding for expression of 

5 E7. 

Figure 3 shows the sequence of plasmid pET/H/E7 coding for expression of 

(h)E7. 

Figures 4A-4B show the sequence of plasmid pET65C/E7-lN coding for 
expression of HspE7; 
10 Figures 5A-5B show the sequence of plasmid pETMT40E7 coding for 

expression of MT40-E7. 

Figure 6 shows the sequence of plasmid pET/OVA coding for expression of 
ovalbumin (OVA). 

Figures 7A-7C show the sequence of plasmid pET65H/OVA coding for 
1 5 expression of HspOVA. 

Figure 8 shows the sequence of plasmid pGEX/K coding for expression of GST. 

Figure 9 shows the sequence of plasmid pGEX/K/E7 coding for expression of 
GST-E7. 

Figures 10A-10B show the sequence of plasmid pET/E7/5'65 coding for 
20 expression of E7-L-BCG65. 

Figure 11 shows the sequence of plasmid pET65Fl/E7 coding for expression of 
BCG65(F1)-E7. 

Figure 12 shows the sequence of plasmid pETESE7 coding for expression of 
TB10-E7. 

25 Figures 13A-13B show the sequence of plasmid pET/E7/71 coding for 

expression of E7-TB71 . 

Figures 14A-14B show the sequence of plasmid pET/E7/7r coding for 
expression of a fusion protein. 

Figures 15A-1 5B show the sequence of plasmid pET/SP65c-E7 coding for 
30 expression of SP65(2)-E7. 

Figures 16A-16B show the sequence of plasmid pETAF60E7 coding for 
expression of AF60-E7. 
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Figures 17A-17B show enhanced IFN-gamma release by splenocytes from 
C57BL/6 mice obtained from the Charles River Laboratory (Fig. 17A) and the Jackson 
Laboratory (Fig. 17B) upon exposure to HspE7. 

Figures 1 8A-18C show enhanced IFN-gamma release by splenocytes from 
5 Balb/c (Fig. 1 8 A), C57BL/6 (Fig. 18B), and C3HeB/FeJ (Fig. 18C) mice upon 
exposure to HspE7. 

Figure 19 shows enhanced IFN-gamma release by splenocytes upon exposure to 
fusion proteins containing an antigen and a stress protein but not upon exposure to a 
fusion protein containing an antigen and a protein other than a stress protein. 
10 Figures 20A-20B show enhanced IFN-gamma release by splenocytes upon 

exposure to fusion proteins containing stress proteins of different types, stress proteins 
from different organisms, or a fragment of a stress protein. 

Figure 21 shows enhanced IFN-gamma release by lymph node cells and 
splenocytes upon exposure to fusion proteins containing an antigen and a stress protein. 
15 Figures 22A-22B show a time course of tumor incidence (Fig. 22A) and tumor 

volume (Fig. 22B) in mice injected with TC-1 tumor cells followed by an injection with 
either saline, HspE7, SP65(2)-E7, or AF60-E7. 

Figures 23 A-23B show a time course of tumor incidence (Fig. 23A) and tumor 
volume (Fig. 23B) in mice injected with TC-1 tumor cells followed by an injection with 
20 either saline, HspE7, MT40-E7, E7-MT71, or TB10-E7. 

Detailed Description 
The invention relates to methods of stimulating in vitro a Thl-like response in a 
cell sample containing naive lymphocytes. These methods are useful for assessing the 
25 ability of a protein, e.g., a fusion protein containing an Hsp linked to a heterologous 
polypeptide, to function as a stimulator of a Thl-like response. Additionally, the 
method can be used to identify compounds that can regulate a Thl-like response. 
Various materials and procedures suitable for use in the methods of the invention are 
discussed below. 

30 The terms stress protein and heat shock protein (Hsp) are used synonymously 

herein. An Hsp is a polypeptide consisting of a sequence that is at least 40% identical 
to that of a protein whose expression is induced or enhanced in a cell exposed to stress. 
Turning to stress proteins generally, cells respond to a stressor (typically heat shock 
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treatment) by increasing the expression of a group of genes commonly referred to as 
stress, or heat shock, genes. Heat shock treatment involves exposure of cells or 
organisms to temperatures that are one to several degrees Celsius above the temperature 
to which the cells are adapted In coordination with the induction of such genes, the 
5 levels of corresponding stress proteins increase in stressed cells. As used herein, a 
"stress protein," also known as a "heat shock protein" or "Hsp," is a protein that is 
encoded by a stress gene, and is therefore typically produced in significantly greater 
amounts upon the contact or exposure of the stressor to the organism. A "stress gene," 
also known as "heat shock gene" is used herein as a gene that is activated or otherwise 

1 0 detectably upregulated due to the contact or exposure of an organism (containing the 
gene) to a stressor, such as heat shock, hypoxia, glucose deprivation, heavy metal salts, 
inhibitors of energy metabolism and electron transport, and protein denaturants, or to 
certain benzoquinone ansamycins. Nover, L., Heat Shock Response, CRC Press, Inc., 
Boca Raton, FL (1991). "Stress gene" also includes homologous genes within known 

1 5 stress gene families, such as certain genes within the Hsp70 and Hsp90 stress gene 

families, even though such homologous genes are not themselves induced by a stressor. 
Each of the terms stress gene and stress protein as used in the present specification may 
be inclusive of the other, unless the context indicates otherwise. 

An antigen can be any compound, peptide or protein to which an immune 

20 response is desired Antigens of particular interest are tumor-associated antigens, 
allergens of any origin, and proteins from viruses, mycoplasma, bacteria, fungi, 
protozoa and other parasites. 



Fusion Proteins 

25 The invention provides Hsp fusion proteins. As used herein, a "fusion protein" 

is a non-naturally occurring polypeptide containing at least two amino acid sequences 
which generally are from two different proteins. The amino acid sequence of the fiill 
length fusion protein is not identical to the amino acid sequence of a naturally occurring 
protein or a fragment thereof. An Hsp fusion protein contains an Hsp or a fragment 

30 thereof at least eight amino acids in length linked to a heterologous polypeptide. An 
"Hsp polypeptide" refers to a polypeptide consisting of a sequence that is at least 40% 
identical to that of a protein whose expression is induced or enhanced in a cell exposed 
to stress, e.g., heat shock. A "heterologous polypeptide" refers to a polypeptide that is 
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fused to the Hsp protein or fragment thereof. The heterologous polypeptide is 
preferably at least eight amino acids in length. In some embodiments, the heterologous 
polypeptide is at least 10, 20, 50, 100, 150, 180, 200, or 300 amino acids in length. The 
heterologous polypeptide generally is not part or all of a naturally occurring Hsp. 

5 However, the fusion protein can also be a fusion between a first Hsp and a second, 
different, Hsp, or between all or portion of an Hsp fused to all or a portion of the same 
Hsp (as long as the resultant fusion is not identical to a naturally occurring protein). 
The Hsp polypeptide can be attached to the N-terminus or C-terminus of the 
heterologous polypeptide. Preferably the fusion protein is a purified protein. 

10 The preferred Hsp fusion protein has one Hsp polypeptide linked to one 

heterologous polypeptide, but other conformations are within the invention. In one 
embodiment, the fusion protein comprises at least two copies of the heterologous 
polypeptide, e.g., HPV16 E7. In another embodiment, the fusion protein contains at 
least two copies of the Hsp polypeptide, e.g., Hsp65. Additionally, the fusion protein 

1 5 can contain at least two different heterologous polypeptides, e.g., two or more 

fragments of a single antigenic protein representing different epitopes or fragments of 
two or more different antigenic proteins derived from the same or different tumors or 
pathogens, and/or at least two different Hsp polypeptides. 

The Hsp and heterologous polypeptide can be directly fused without a linker 

20 sequence. In preferred embodiments, the C-tenninus of the Hsp can be directly fused 
to the N-terminus of the heterologous polypeptide or the C-terminus of the 
heterologous polypeptide can be directly fused to the N-terminus of the Hsp. 

Alternatively, Hsp and heterologous polypeptides can be linked to each other 
via a peptide linker sequence. Preferred linker sequences (1) should adopt a flexible 

25 extended conformation, (2) should not exhibit a propensity for developing an ordered 
secondary structure which could interact with the functional Hsp and heterologous 
polypeptide domains, and (3) should have minimal hydrophobic or charged character, 
which could promote interaction with the functional protein domains. Typical surface 
amino acids in flexible protein regions include Gly, Asn and Ser. Permutations of 

30 amino acid sequences containing Gly, Asn and Ser would be expected to satisfy the 
above criteria for a linker sequence. Other neutral or near-neutral amino acids, such as 
Thr and Ala, can also be used in the linker sequence. Any other amino acid can also be 
used in the linker. A linker sequence length of fewer than 20 amino acids can be used 
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to provide a suitable separation of fractional protein domains, although longer linker 
sequences may also be used. 

The Hsp fusion protein may be further fused to another amino acid sequence 
that facilitates the purification of the fusion protein. One useful fusion protein is a GST 
5 fusion protein in which the Hsp-heterologous polypeptide sequences are fused to the 
C-terminus or N-terminus of the GST sequence. Another useful fusion protein is a 
poly-histidine (His) fusion protein in which the Hsp-heterologous polypeptide 
sequences are fused to either the C-terminus or N-terminus of the poly-histidine 
sequence, e.g. His x 6. In another embodiment, the fusion protein contains the chitin- 

1 0 binding region of intein, thereby permitting the purification of the fusion protein by 
chitin beads (Hoang et al. (1999) Gene 1999 237:361-71). In another embodiment, the 
fusion protein contains a signal sequence from another protein. In certain host cells 
(e.g., mammalian host cells), expression and/or secretion of the Hsp fusion protein can 
be increased through use of a heterologous signal sequence. For example, the gp67 

15 secretory sequence of the baculovirus envelope protein can be used as a heterologous 
signal sequence (Current Protocols in Molecular Biology, Ausubel et al., eds., John 
Wiley & Sons, 1992). Other examples of eukaryotic signal sequences include the 
secretory sequences of melittin and human placental alkaline phosphatase (Stratagene; 
La Jolla, California). Prokaryotic signal sequences useful for increasing secretion by a 

20 prokaryotic host cell include the phoA secretory signal (Molecular Cloning, Sambrook 
et al, second edition, Cold Spring Harbor Laboratory Press, 1989) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway, New Jersey). 

Fusion proteins of the invention, e.g., a fusion protein of Hsp65 and HPV16 E7, 
can be produced by standard recombinant techniques. For example, DNA fragments 

25 coding for the different polypeptide sequences are ligated together, in any order, 
in-frame in accordance with conventional techniques. Such techniques can include 
employing blunt-ended or stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, 
alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. 

30 Correct linkage of the two nucleic acids requires that the product of the linkage encode 
a chimeric protein consisting of a Hsp moiety and a heterologous polypeptide moiety. 
In another embodiment, the fusion gene can be synthesized by conventional techniques, 
including automated DNA synthesizers. Alternatively, PCR amplification of gene 
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fragments can be carried out using anchor primers which give rise to complementary 
overhangs between two consecutive gene fragments, which are subsequently annealed 
and reamplified to generate a chimeric gene sequence (see, e.g., Current Protocols in 
Molecular Biology, Ausubel et al. eds., John Wiley & Sons: 1992). 

Expression vectors encoding fusion proteins containing a heterologous 
polypeptide and either an Hsp or a protein other than an Hsp can be prepared by the 
above procedures. Examples of Hsp fusion proteins can be found in international 
patent application WO 99/07860, incorporated herein by reference, that describes 
vector construction, expression and purification of Mycobacterium bovis BCG Hsp65 - 
HPV16 E7 (HspE7) fusion protein as well as of HPV16 E7 (E7), histidine tagged 
HPV16 E7 (hE7), and M. bovis BCG Hsp65 (Hsp65). Additional examples of nucleic 
acids encoding an Hsp optionally linked to a heterologous polypeptide, e.g., an HPV 
antigen, are described in WO 89/12455, WO 94/29459, WO 98/23735, and references 
cited therein, the contents of which are herein incorporated by reference. 

A variety of heat shock proteins have been isolated, cloned, and characterized 
from a diverse array of organisms (Mizzen, Biotherapy 10:173-189, 1998). Any Hsp or 
fragment thereof may be suitable for use in the fusion polypeptides and conjugates of 
the invention. For example, Hsp70, Hsp60, Hsp20-30, and Hsp 10 are among the major 
determinants recognized by host immune responses to infection by Mycobacterium 
tuberculosis and Mycobacterium leprae. In addition, Hsp65 of Bacille Calmette Guerin 
(BCG), a strain of Mycobacterium bovis, was found to be an effective stimulatory 
agent, as described in the examples below. 

Families of stress genes and proteins for use in the present invention are well 
known in the art and include, for example, HsplOO-200, HsplOO, Hsp90, Lon, Hsp70, 
Hsp60, TF55, Hsp40, FKBPs, cyclophilins, Hsp20-30, ClpP, GrpE, HsplO, ubiquitin, 
calnexin, and protein disulfide isomerases. See, e.g., Macario, Cold Spring Harbor 
Laboratory Res. 25:59-70, 1995; Parsell et al., Rev. Genet. 27:437-496, 1993; and U.S. 
Patent No. 5,232,833. Preferred Hsps include Hsp65, Hsp40, HsplO, Hsp60, and 
Hsp71. 

The Hsp portion of the fusion protein can include either a full length Hsp or a 
fragment of an Hsp at least eight amino acids in length. In some embodiments, the Hsp 
fragment is greater than 10 amino acids in length, and preferably is at least 20, 50, 100, 
1 50, 1 80, 200, or 300 amino acids in length. In one embodiment, the Hsp portion of the 
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fusion protein consists of amino acids 1-200 of Hsp65 of Mycobacterium bovis. Other 
portions of Hsp65 and other Hsps can be used in a fusion protein to elicit a Thl-like 
response in vitro. Other preferred Hsps include Hsp40 of M. tuberculosis, HsplO of M 
tuberculosis, Hsp65 of Streptococcus pneumoniae, and Hsp60 of Aspergillus fumigatus. 
5 Heterologous polypeptides can contain any amino acid sequence useful for stimulating 
an immune response, in vitro and/or in vivo. Preferably, the heterologous polypeptide 
contains an MHC-binding epitope, e.g., an MHC class I or MHC class II binding 
epitope. The heterologous polypeptide can contain sequences found in a protein 
produced by a human pathogen, e.g., viruses, bacteria, mycoplasma, fungi, protozoa, 

10 and other parasites, or sequences found in the protein of a tumor associated antigen 
(TAA). Examples of viruses include human papilloma virus (HPV), herpes simplex 
virus (HSV), hepatitis B virus (HBV), hepatitis C virus (HCV), cytomegalovirus 
(CMV), Epstein-Barr virus (EBV), influenza virus, measles virus, and human 
immunodeficiency virus (HIV). Examples of tumor associated antigens include 

1 5 MAGE1, MAGE2, MAGE3, BAGE, GAGE, FRAME, SSX-2, Tyrosinase, MART-1, 
NY-ESO-1, gplOO, TRP-l, TRP-2, A2 melanotope, BCR/ABL, Proeinase- 
3/Myeloblastin, HER2/neu, CEA, PI A, HK2, PAPA, PSA, PSCA, PSMA, pg75, 
MUM-1, MUC-1, E6, E7, GnT-V, Beta-catenin, CDK4 and 
P15. 

20 HPV antigens from any strain of HPV are suitable for use in the fusion 

polypeptide. HPV expresses six or seven non-structural and two structural proteins. 
Viral capsid proteins LI and L2 are the late structural proteins. LI is the major capsid 
protein, the amino acid sequence of which is highly conserved among different HPV 
types. There are seven early non-structural proteins. Proteins El, E2, and E4 play an 

25 important role in virus replication. Protein E4 also plays a role in virus maturation. 
The role of E5 is less well known. Proteins E6 and E7 are oncoproteins critical for 
viral replication, as well as for host cell immortalization and transformation. Fusion 
proteins of the invention can contain either the entire sequence of an HPV protein or a 
fragment thereof, e.g., a fragment of at least 8 amino acids. In one embodiment, the 

30 HPV antigenic sequence is derived from a "high risk" HPV, such as HPV16 or HPV18 
E7 protein. The HPV antigenic sequence can include an MHC-binding epitope, e.g., an 
MHC class I and/or an MHC class II binding epitope. 
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In addition to Hsp fusion proteins, other fusion proteins can be used in the in 
vitro assay described herein. These non-Hsp fusion proteins contain a first polypeptide 
at least eight amino acids in length, fused to a second polypeptide at least eight amino 
acids in length, wherein the first and second polypeptides are derived from different 

5 proteins (preferably naturally occurring proteins). The fusion protein itself does not 
have the sequence of a naturally occurring protein. 

In the fusion protein of the invention, neither the first nor second polypeptide is 
an amino acid sequence that is commonly used for protein purification or detection, 
e.g., GST or poly-histidine. 

10 In order to produce the fusion protein, a nucleic acid encoding the fusion protein 

can be introduced into a host cell, e.g., a bacterium, a primary cell, or an immortalized 
cell line using an expression vector. The recombinant cells are then used to produce the 
fusion protein. The transfection can be transient or stable, the later sometimes 
accomplished by homologous recombination. 

1 5 The nucleotide sequence encoding a fusion protein will usually be operably 

linked to one or more regulatory sequences, selected on the basis of the host cells to be 
used for expression. The term "regulatory sequence" refers to promoters, enhancers 
and other expression control elements (e.g., polyadenylation signals). Such regulatory 
sequences are described, for example, in Goeddel (1990) Gene Expression Technology: 

20 Methods in Enzymology 185, Academic Press, San Diego, CA, the content of which is 
incorporated herein by reference. Regulatory sequences include those that direct 
constitutive expression of a nucleotide sequence in many types of host cells, those that 
direct expression of the nucleotide sequence only in certain host cells (e.g., tissue- 
specific regulatory sequences), and those that direct expression in a regulatable manner 

25 (e.g., only in the presence of an inducing agent). It will be appreciated by those skilled 
in the art that the design of the expression vector may depend on such factors as the 
choice of the host cell to be transformed, the level of expression of fusion protein 
desired, and the like. 

Recombinant expression vectors can be designed for expression of fusion 

30 proteins in prokaryotic or eukaryotic cells. For example, fusion proteins can be 
expressed in bacterial cells such as E. coli, insect cells (e.g., in the baculovirus 
expression system), yeast cells or mammalian cells. Some suitable host cells are 
discussed further in Goeddel (1990) Gene Expression Technology: Methods in 
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Enzymology 185, Academic Press, San Diego, CA. Examples of vectors for expression 
in yeast S. cerevisiae include pYepSecl (Baldari et al (1987) EMBO J. 6:229-234), 
pMFa (Kuijan and Herskowitz (1982) Cell 30:933-943), pJRY88 (Schultz et al (1987) 
Gene 54:1 13-123), and pYES2 (Invitrogen Corporation, San Diego, CA). Baculovirus 

5 vectors available for expression of fusion proteins in cultured insect cells (e.g., Sf 9 
cells) include the pAc series (Smith et al (1983) Mol Cell Biol 3:2156-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

Examples of mammalian expression vectors include pCDM8 (Seed (1987) 
Nature 329:840) and pMT2PC (Kaufinan et al (1987), EMBO J. 6:187-195). When 

10 intended for use in mammalian cells, the expression vector's control functions are often 
provided by viral regulatory elements. For example, commonly used promoters are 
derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40. 

In addition to the regulatory control sequences discussed above, the 
recombinant expression vector can contain additional nucleotide sequences. For 

1 5 example, the recombinant expression vector may encode a selectable marker gene to 
identify host cells that have incorporated the vector. Moreover, to facilitate secretion of 
the fusion protein from a host cell, in particular mammalian host cells, the recombinant 
expression vector can encode a signal sequence linked to the ammo-terminus of the 
fusion protein, such that upon expression, the fusion protein is synthesized with the 

20 signal sequence fused to its amino terminus. This signal sequence directs the fusion 
protein into the secretory pathway of the cell and is then usually cleaved, allowing for 
release of the mature fusion protein (i.e., the fusion protein without the signal sequence) 
from the host cell. Use of a signal sequence to facilitate secretion of proteins or 
peptides from mammalian host cells is known in the art. 

25 Vector DNA can be introduced into prokaryotic or eukaryotic cells via 

conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" refer to a variety of art-recognized techniques for 
introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 

30 lipofection, electroporation, microinjection and viral-mediated transfection. Suitable 
methods for transforming or transfecting host cells can be found in Sambrook et al 
(Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor 
Laboratory Press (1989)), and other laboratory manuals. 
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Often only a small fraction of mammalian cells integrate the foreign DNA into 
their genome. In order to identify and select these integrants, a gene that encodes a 
selectable marker {e.g., resistance to antibiotics) can be introduced into the host cells 
along with the gene encoding the fusion protein. Preferred selectable markers include 
those that confer resistance to drugs such as G418, hygromycin and methotrexate. 
Nucleic acid encoding a selectable marker can be introduced into a host cell on the 
same vector as that encoding the fusion protein or can be introduced on a separate 
vector. Cells stably transfected with the introduced nucleic acid can be identified by 
drug selection {e.g., cells that have incorporated the selectable marker gene will 
survive, while the other cells die). 

Alternatively, a recombinant expression vector can be transcribed and translated 
in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

In addition to the recombinant techniques described above, a fusion protein of 
the invention can be formed by linking two polypeptides, e.g., a Hsp and a heterologous 
polypeptide, to form a conjugate. Methods of forming Hsp conjugates are described in 
WO 89/12455, WO 94/29459, WO 98/23735, and WO 99/07860, the contents of which 
are herein incorporated by reference. As used herein, an Hsp "conjugate" comprises an 
Hsp that has been covalently linked to a heterologous polypeptide via the action of a 
coupling agent. A conjugate thus comprises two separate molecules that have been 
coupled one to the other. The term "coupling agent," as used herein, refers to a reagent 
capable of coupling one polypeptide to another polypeptide, e.g., a Hsp to a 
heterologous polypeptide. Any bond which is capable of linking the components such 
that the linkage is stable under physiological conditions for the time needed for the 
assay (e.g., at least 12 hours, preferably at least 72 hours) is suitable. The link between 
two components may be direct, e.g., where a Hsp is linked directly to a heterologous 
polypeptide, or indirect, e.g., where a Hsp is linked to an intermediate, e.g., a backbone, 
and that intermediate is also linked to the heterologous polypeptide. A coupling agent 
should function under conditions of temperature, pH, salt, solvent system, and other 
reactants that substantially retain the chemical stability of the Hsp, the backbone (if 
present), and the heterologous polypeptide. 

A coupling agent can link components, e.g., a Hsp and a heterologous 
polypeptide, without the addition of the coupling agent to the resulting fusion protein. 
Other coupling agents result in the addition of the coupling agent to the resulting fusion 
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protein. For example, coupling agents can be cross-linking agents that are homo- or 
hetero-bifunctional, and wherein one or more atomic components of the agent is 
retained in the composition. A coupling agent that is not a cross-linking agent can be 
removed entirely following the coupling reaction, so that the molecular product is 
5 composed entirely of the Hsp, the heterologous polypeptide, and a backbone moiety (if 
present). 

Many coupling agents react with an amine and a carboxylate, to form an amide, 
or an alcohol and a carboxylate to form an ester. Coupling agents are known in the ait, 
see, e.g., M. Bodansky, "Principles of Peptide Synthesis", 2nd ed., referenced herein, 
10 and T. Greene and P. Wuts, "Protective Groups in Organic Synthesis," 2nd Ed, 1991, 
John Wiley, NY. Coupling agents should link component moieties stably, but such that 
there is minimal or no denaturation or deactivation of the Hsp or the heterologous 
polypeptide. 

The conjugates of the invention can be prepared by coupling a Hsp to a 
1 5 heterologous polypeptide using methods known in the art. A variety of coupling 

agents, including cross-linking agents, can be used for covalent conjugation. Examples 
of cross-linking agents include N,>T-dicyclohexylcarbodiimide PCC; Pierce), N- 
succinimidyl-S-acetyl-thioacetate (SATA), N-succinimidyl-3-(2- 
pyridyldithio)propionate (SPDP), ortho-phenylenedimaleimide (o-PDM), and 
20 sulfosuccinimidyl 4-(N-maleimidomethyl) cyclohexane- 1 -carboxylate (sulfo-SMCC). 
See, e.g., Karpovsky et al (1984) J. Exp, Med. 160:1686 and Liu et al (1985) Proc. 
Natl. Acad. Set USA 52:8648. Other methods include those described by Paulus 
{\9%5) Behring Ins. Mitt. 78:118-132; Brennanef a/. (1985) Sc/ence 229:81-83; and 
Glennie et aL (1987) J. Immunol. 139: 2367-2375. A large number of coupling agents 
25 for peptides and proteins, along with buffers, solvents, and methods of use, are 
described in the Pierce Chemical Co. catalog, pages T-155 -T-200, 1994 (3747 N. 
Meridian Rd., Rockford IL, 61 105, U.S.A.; Pierce Europe B.V., P.O. Box 1512, 3260 
BA Oud Beijerland, The Netherlands), which catalog is hereby incorporated by 
reference. 

30 DCC is a useful coupling agent (Pierce #20320; Rockford, IL). It promotes 

coupling of the alcohol NHS in DMSO (Pierce #20684), forming an activated ester 
which can be cross-linked to polylysine. DCC (N^-dicyciohexylcarbodiimide) is a 
carboxy-reactive cross-linker commonly used as a coupling agent in peptide synthesis, 
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and has a molecular weight of 206.32. Another useful cross-linking agent is SPDP 
(Pierce #21 557), a heterobifunctional cross-linker for use with primary amines and 
sulfhydryl groups. SPDP has a molecular weight of 3 12.4 and a spacer arm length of 
6.8 angstroms, is reactive to NHS-esters and pyridyldithio groups, and produces 
5 cleavable cross-linking such that upon further reaction, the agent is eliminated so the 
Hsp can be linked directly to a backbone or heterologous polypeptide. Other useful 
conjugating agents are SATA (Pierce #26102) for introduction of blocked SH groups 
for two-step cross-linking, which are deblocked with hydroxylamine-HCl (Pierce 
#26103), and sulfo-SMCC (Pierce #22322), reactive towards amines and sulfhydryls. 

10 Other cross-linking and coupling agents are also available from Pierce Chemical Co. 
(Rockford, IL). Additional compounds and processes, particularly those involving a 
Schiff base as an intermediate, for conjugation of proteins to other proteins or to other 
compositions, for example to reporter groups or to chelators for metal ion labeling of a 
protein, are disclosed in EP 243,929 A2 (published Nov. 4, 1987). 

15 Polypeptides that contain carboxyl groups can be joined to lysine e-amino 

groups in the heterologous polypeptide either by preformed reactive esters (such as N- 
hydroxy succinimide ester) or esters conjugated in situ by a carbodiimide-mediated 
reaction. The same applies to Hsps containing sulfonic acid groups, which can be 
transformed to sulfonyl chlorides that react with amino groups. Hsps that have 

20 carboxyl groups can be joined to amino groups on the polypeptide by an in situ 
carbodiimide method. Hsps can also be attached to hydroxyl groups of serine or 
threonine residues, or to sulfhydryl groups of cysteine residues. 

In addition to conjugates of two polypeptides, e.g., a Hsp and a heterologous 
polypeptide, hybrid compounds can be constructed containing a non-peptide compound 

25 covalently linked to a polypeptide at least eight amino acids in length. The polypeptide 
component of this hybrid compound can be any of the heterologous polypeptides 
described herein as a component of a Hsp fusion protein or conjugate. Examples of the 
non-peptide component of this hybrid compound include polynucleotides, 
polynucleotide analogs, nucleotides, nucleotide analogs, organic or inorganic 

30 compounds having a molecular weight less than about 5,000 grams per mole, 

preferably between about 1,500 and 100 grams per mole, and salts, esters, and other 
pharmaceutical^ acceptable forms of such non-peptide compounds. 
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In Vitro Assays for Thl -Like Activity 

Cell samples containing naive lymphocytes are prepared from any mammal, 
e.g., a mouse, rat, rabbit, goat, or human, and are plated at an appropriate density in one 
or more tissue culture plates. A naive lymphocyte is a lymphocyte that has not been 

5 exposed (either in vivo or in vitro) to the fusion protein (or to either of the polypeptides 
that are joined to make the fusion protein) prior to the cell's use in the in vitro assay. 
The cell sample can be derived from any of various primary or secondary lymphoid 
organs or tissues of an animal, e.g., spleen, lymph node, peripheral blood, bone 
marrow, or thymus. The sample may also be derived from any tissue in the body 

10 containing lymphoid cells, such as the lung, respiratory tract (including pharynx, 
larynx, trachea, bronchi, etc), and anogenital mucosa. The cell sample can include 
naive lymphocytes selected from NK cells, NK T cells, <xPT cells and y5T cells. The 
cell sample can be either unfractionated or enriched for a particular cell type or cell 
types. In addition to naive lymphocytes, the cell sample can optionally include naive 

1 5 antigen presenting cells such as macrophages, dendtritic cells, and/or B cells. The cell 
sample can optionally include cell lines, e.g., a transformed T cell line or a T cell clone. 

The cell sample is exposed in vitro to a fusion protein or a conjugate described 
herein. Following a period of incubation between the cell sample and the fusion 
protein or conjugate, e.g., 6, 12, 24, 36, 48, 60, 72, or 96 hours, a determination is made 

20 as to whether a Thl-like response has been elicited in the cell sample. A Thl-like 
response can be detected, for example, by measuring the production of particular 
lymphokines, e.g., IFN-gamma or TNF-beta, by the cell sample. Alternatively, a Thl- 
like response can be detected by assaying for cell surface marker expression, such as 
SLAM (signaling lymphocytic activation molecule), or for cytokine expression, using a 

25 variety of techniques (for example, flow cytometry). 

In one example, pooled, unfractionated splenocyte cultures containing naive 
lymphocytes are prepared from a mouse and are plated in tissue culture plates. 
Methods of isolating and culturing splenocytes are described in Current Protocols in 
Immunology, Coligan et aL, eds., John Wiley & Sons, 2000. Cultures of splenocytes 

30 are then exposed to different concentrations of a test protein, e.g., a recombinant Hsp 
fusion protein, Hsp, the antigen alone, or another antigen-containing fusion protein, for 
a time that is sufficient to elicit a measurable IFN-gamma response against a standard 
antigen-stress protein fusion protein such as, for example, HspE7, described in patent 
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application WO 99/07860 and employed in the Examples below. Following exposure 
of the cell sample to the test protein, the IFN-gamma level in the extracellular medium 
is determined using a suitable assay such as an IFN-gamma ELISA. 

Results of the assays described below reveal that IFN-gamma release elicited by 
5 exposure of splenocytes or lymph node cells to an Hsp fusion protein is much more 
substantial than that induced by exposure to the antigen itself, the Hsp itself, an 
admixture of antigen and Hsp, or a fusion between antigen and a protein other than a 
Hsp. 

The assay of the invention can be used to evaluate a preparation of an Hsp 

10 fusion protein (e.g., as a quality control assay) or compare different preparations of Hsp 
fusion proteins. The measurements taken in the assay constitute a method for 
identifying a particularly active batch or to eliminate substandard batches of fusion 
protein preparations. The assay may also be used to optimize production procedures, 
storage regimes, etc. In cases in which a maximal Thl-like response to a particular 

15 antigen is desired, the assays can be used to test different fusions between the antigen 
and different types of Hsps or Hsps of different origins. Furthermore, the assay can be 
used to test a series of different candidate antigens, to identify the antigen that gives 
rise to the most pronounced Thl-like response when fused to a Hsp. 

The assay can also be used to identify regions in an antigen sequence or an Hsp 

20 sequence that are primarily responsible for eliciting a Thl-like response and thus have 
therapeutic potential. To identify such active regions in an antigen, fusions containing 
individual subregions of the antigen fused to an Hsp can be prepared and tested in the 
assay of the invention. To identify active regions in an Hsp, fusions containing 
individual subregions of the Hsp fused to the antigen can be prepared and tested. These 

25 determinations will provide the basis for the construction of shortened fusion proteins 
comprising subregions of antigen and/or Hsp that are sufficient to elicit a Thl-like 
response. Fusions containing subregions of a Hsp and/or subregions of an antigen can 
be tested by comparing the elicited Thl-like response to that induced by a full length 
fusion protein with known activity, e.g., HspE7. 

30 The fusion proteins described herein are useful in assays for screening 

compounds for their effectiveness in stimulating a Thl-like response. For example, the 
Hsp fusion proteins that were found to stimulate IFN-gamma secretion in the in vitro 
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assay can be used as controls to test candidate compounds for their ability to produce 
the same effect. 

The system described herein for stimulating a Thl-like response in vitro can be 
used to generate activated Thl cells ex vivo for reimplantation into an individual. This 
5 may be useful for treating conditions characterized by a dominant Th2 immune 
response and an insufficient Thl response. 

The assay can also be used to identify compounds that can regulate a Thl-like 
response. Compounds can be screened for their ability to inhibit an Hsp-fusion protein- 
induced Thl-like response, or to promote a Thl-like response in a manner similar to a 

1 0 Hsp fusion protein, or to enhance the Thl-like response induced by a Hsp fusion 
protein (or any other protein found to act in a manner comparable to a Hsp fusion 
protein). Inhibitory compounds may be useful to treat conditions characterized by an 
inappropriate Thl response, e.g., inflammatory and autoimmune diseases. Potential 
inhibitors (e.g., of binding of antigen-stress protein fusion proteins to antigen- 

1 5 presenting cells or of stress protein fusion-enhanced antigen processing) can be 
screened as follows. A cell sample comprising naive lymphocytes is mixed with a 
fusion protein or conjugate that is known to induce a Thl -like response, e.g., IFN- 
gamma secretion. Compounds to be screened as potential inhibitors are added to the 
cell culture either before, after, or simultaneous to the addition of the fusion protein or 

20 conjugate. The effect of the compound on the induction of a Thl-like response, e.g., as 
measured by IFN-gamma release, can be determined by comparing the response to that 
obtained when the fusion protein or conjugate alone is added to the cell sample. 

In a similar manner, compounds can be screened for their ability to promote a 
Thl-like response. Any compound can be screened for its ability to regulate a Thl-like 

25 response, including both peptides and non-peptide chemicals. These compounds 

include, but are not limited to, peptides, peptidomimetics (e.g., peptoids), amino acids, 
amino acid analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide 
analogs, organic or inorganic compounds having a molecular weight less than about 
5,000 grams per mole, and salts, esters, and other pharmaceutical^ acceptable forms of 

30 such compounds. In this case, a cell sample comprising naive lymphocytes is contacted 
with a test compound. The effect of the test compound on the induction of a Thl-like 
response, e.g., as measured by IFN-gamma release, is then measured and compared to a 
control (no test sample) or compared to an Hsp fusion known to stimulate a Thl-like 
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response. This assay can be used to identify novel compounds that can be used to 
stimulate a Thl-like response. Preferably the Thl-like response stimulated by the 
compound is at least 25%, e.g., at least 40%, 50%, 60%, 70%, or 80%, the level of the 
maximum response induced by an HspE7 fusion protein. In one embodiment, the 
compound is preferably not a naturally occurring compound. In another embodiment, 
the compound is a peptide, wherein the peptide does not correspond to the fragment of 
a naturally occurring protein. 

The following are examples of the practice of the invention. They are not to 
be construed as limiting the scope of the invention in any way. 

Examples 

Example 1 : Bacterial Growth and Cell Lysis for Production of Recombinant Proteins 
E. coli strains BL21(DE3) or BLR(DE3) (Novagen) were used as the host for all 
recombinant protein production, with the exception of pET65, which was transformed 
into BL21(DE3) pLysS (Novagen). BL21(DE3) pLysS cells harboring pET65 were 
grown in 2xYT media (20 g/L tryptone; 10 g/L yeast extract, 20 g/L NaCl; Milli-Q™ 
quality water) containing 30 ng/ml kanamycin and 34 ng/ml chloramphenicol, while all 
other transformants were grown in 2xYT media containing 30 fig/ml kanamycin. All 
bacterial cultures were grown in 2 L shaker flasks at 200-400 rpm to OD6oo=0.5 and 
then induced with 0.5 mM IPTG for 3 hours at 37°C. Cells were then harvested by 
centrifugation at 4°C and 4,000 - 8,000 g for 5 minutes, then suspended in 300 ml of 
Lysis Buffer (10 mM TRIS-HC1, 10 mM 2-mercaptoethanol, pH 7.5), lysozyme was 
added to 200 ng/mL, and the suspension mixed and frozen at -70°C. 

To purify the recombinant protein, the cells were thawed using a 37°C 
waterbath and proteinase inhibitors were added (2 ng/ml aprotinin, 2 ng/ml leupeptin, 2 
^Lg/ml pepstatin and 2 mM PMSF). The cell suspension was split into 50 mL samples, 
stored on ice, and sonicated 3-4 times for 30 seconds at Power-Level 5-8 (Sonicator 
450, Branson, Corp.). The supernatant was separated from the pellet by centrifugation 
at 35,000 - 60,000 g for 1 0 -20 minutes at 4°C. For soluble proteins, the supernatant 
was kept and processed as the Soluble Fraction. For proteins found in inclusion bodies, 
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the supernatant was discarded and the pellet was washed with Lysis Buffer (optionally 
containing I M urea, 1 %(v/v) Triton X-100). The resulting mixture was then 
centrifugation at 35,000 - 60,000 g for 10 -20 minutes at 4°C and the supernatant 
discarded. The pellet was dissolved in Lysis Buffer containing 8 M urea. This mixture 
was then centrifiiged at 4°C for 10- 20 minutes at 35,000 - 60,000 g and the pellet was 
discarded and the supernatant stored at -70°C as the Inclusion Body fraction. 

Example 2: Production of Recombinant M bovis BCG Hsp65 (Hsp65) 

A plasmid encoding Hsp65 was constructed as follows. The M bovis BCG 
Hsp65 coding sequence was PCR amplified from pRIB1300 (van Eden et al (1988) 
Nature 331 :1 7 1-173) using the following primers. The forward primer (w046: 5' TTC 
GCC ATG GCC AAG ACA ATT GCG 3'; SEQ ID NO:l) contains an ATG start 
codon at an Ncol site. The reverse primer (w078: 5' TTC TCG GCT AGC TCA GAA 
ATC CAT GCC 3'; SEQ ID NO:2) contains an Nhe I site downstream of a TGA stop 
codon. The PCR product was digested with Ncol and Nhel, purified and ligated to 
pET28a (Novagen) which had been cut with Ncol and Nhel. Plasmid pET65 encodes 
the M. bovis BCG Hsp65 protein, abbreviated Hsp65. The nucleotide sequence (SEQ 
ID NO:3) coding for expression of Hsp65 (SEQ ID NO:4) is shown in Figs. 1 A-1B. 

The Hsp65 protein was purified as follows. The Soluble Fraction was prepared 
as described above from E. colt BL21(DE3) pLysS cells transformed with plasmid 
pET65. The M. bovis BCG Hsp65 protein (Hsp65) present in the Soluble Fraction was 
purified by the following chromatographic steps: SP-Sepharose (200 ml column, 
Amersham Pharmacia), Q-Sepharose (200 ml column, Amersham Pharmacia), 
Sephacryl S-300 (500 ml column, Amersham Pharmacia) and ceramic hydroxyapatite 
(HAP; 100 ml column, Biorad). Purified Hsp65 was exchanged into Dulbecco's 
modified phosphate buffered saline (DPBS)/15% (v/v) glycerol and stored at -70°C. 

Example 3: Production of Recombinant HPV16 E7 (El) 

A plasmid encoding HPV16 E7 was constructed as follows. The HPV16 E7 
coding sequence was PCR-amplified from pSK/HPV16 (ATCC) using primers w280 
and wl34 (w280: CCA GCT GTA ACC ATG GAT GGA GAT (SEQ ED NO:5) and 
wl34: AGC CAT GAA TTC TTA TGG TTT CTG (SEQ ID NO:6)). The PCR product 
was digested with restriction enzyme Nco I and EcoR I and purified from an agarose 
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gel. The purified PCR product was ligated to pET28a that had been previously 
digested with the same enzymes. The ligation reaction was used to transform E. coli 
DH5alpha and putative clones containing the HPV16 E7 gene insert were selected 
based on diagnostic restriction digestion. This initial restriction analysis was confirmed 
5 by DNA sequence analysis of entire gene, promoter and termination regions. DNA of 
the confirmed construct, named pET/E7 (NH), was then introduced by electroporation 
into E. coli strain BL21(DE3). The nucleotide sequence (SEQ ID NO:7) coding for 
expression of E7 (SEQ ID NO:8) is shown in Fig. 2. 

The HPV16 E7 protein was purified as follows. The Soluble Fraction was 

1 0 prepared as described above from E. coli BL2 1 (DE3) cells transformed with plasmid 
pET/E7 (NH). The HPV 16 E7 protein was purified by the following chromatographic 
steps: Q-Sepharose (1 00 ml column, Araersham Pharmacia); Superdex 200 (26/60 
column, Amersham Pharmacia); and Ni-chelating Sepharose (100 ml, Amersham 
Pharmacia) under denaturing conditions with serial washings containing 2% (v/v) 

1 5 Triton X- 1 00 followed by serial washing to remove residual Triton X- 1 00, and the 
pooled fractions containing HPV E7 protein were then dialyzed overnight against 30 
mM TRISHC1, 1 M NaCl, 1 mM 2-mercaptoethanol, pH 7.5. The dialyzed protein 
was further purified by Ni-chelating Sepharose (75 ml, Amersham Pharmacia) under 
denaturing conditions with serial washings containing 2% (v/v)Triton X-100 followed 

20 by serial washing to remove residual Triton X-l 00. The purity of the protein was 

checked by SDS-PAGE, the appropriate fractions pooled and dialyzed overnight at 4°C 
against DPBS/10 %(v/v) glycerol. 

Example 4: Production of Recombinant Histidine-tagged HPV 1 6 E7 ((h)E7) 
25 A plasmid encoding (h)E7 was constructed as follows. The HPV 16 E7 coding 

sequence was PCR amplified from HPV16 genomic DNA (pSK/HPV16) using the 
following primers. The forward primer (wl33: 5' AAC CCA GCT GCT AGC ATG 
CAT GGA GAT 3'; SEQ ED NO:9) contains an Nhel site upstream of an ATG start 
codon. The reverse primer (wl34: 5' AGC CAT GAA TTC TTA TGG TTT CTG 3*; 
30 SEQ ID NO: 10) contains an EcoRI site downstream of a TAA stop codon. The PCR 
product was digested with Nhel and EcoRI, purified and ligated to pET28a which had 
been cut with Nhel and EcoRI. pET/H/E7 which encodes the HPV16 E7 protein 
containing an N-terminal histidine tag, abbreviated (h)E7, was used to transform E. coli 
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BL21(DE3) cells. The nucleotide sequence (SEQ ID NO:l 1) coding for expression of 
(h)E7 (SEQ ID NO:12) is shown in Fig. 3. 

The (h)E7 protein was purified as follows. The Inclusion Body fraction was 
prepared as described above from £. coli BL21(DE3) cells transformed with plasmid 
pET/H/E7. The N-terminal histidine-tagged HPV16 E7 protein ((h)E7) present in the 
Inclusion Body fraction was purified using the following chromatographic steps: Ni- 
chelating Sepharose (60 ml, Amersham Pharmacia) under denaturing conditions with 
serial washings containing 2% (v/v) Triton X-100 followed by serial washing to 
remove residual Triton X-100. Bound (h)E7 was refolded on the resin and eluted by a 
50-500 mM imidazole gradient. Purified (h)E7 was dialyzed against DPBS/25% (v/v) 
glycerol. 

Example 5: Production of Recombinant HPV 16 E7 - M. bovis BCG 65 Fusion 
Protein (HspE7) 

A plasmid encoding HspE7 was constructed as follows. The M bovis BCG 
Hsp65 coding sequence was PCR ampUfied from pRIB 1300 using the same forward 
primer (w046) as for pET65. The reverse primer (w076: 5' CGC TCG GAC GCT 
AGC TCA CAT ATG GAA ATC CAT GCC 3'; SEQ ID NO:13) contains an Ndel site 
upstream and an Nhel site downstream of a TGA stop codon. The PCR product was 
digested with Ncol and Nhel, purified and ligated to pET28a which had been cut with 
Ncol and Nhel. 

The HPV16 E7 coding sequence was PCR amplified from HPV16 genomic 
DNA (pSK/HPV16) using the following primers. The forward primer (wl51: 5' CCA 
GCT GTA CAT ATG CAT GGA GAT 3'; SEQ ID NO:14) contains an ATG start 
codon at an Ndel site. The reverse primer (wl34: 5* AGC CAT GAA TTC TTA TGG 
TTT CTG 3'; SEQ ID NO: 15) contains an EcoRI site downstream of a TAA stop 
codon. The PCR product was digested with Ndel and EcoRI, purified and ligated to 
pET65C which had been cut with Nde I and EcoRI and the resulting plasmid 
(pET65C/E7-lN) was transformed into E. coli BL21(DE3) cells. pET65C/E7-lN 
encodes a fusion protein consisting of Hsp65 linked via its C-terminus to HPV 16 E7, 
abbreviated HspE7. The nucleotide sequence (SEQ ID NO: 1 6) coding for expression 
of HspE7 (SEQ ID NO: 17) is shown in Figs. 4A-4B. 
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The HspE7 protein was purified as follows. The Soluble Fraction was prepared 
as described above from E. co//BL21(DE3) cells transformed with plasmid 
pET65C/E7-lN. Hsp65-HPV16 E7 fusion protein (HspE7) present in the Soluble 
Fraction was purified by the following chromatographic steps: 0-15% ammonium 
5 sulfate precipitation, Ni-chelating Sepharose (100 ml column, Amersham Pharmacia) 
and Q-Sepharose (100 ml column, Amersham Pharmacia). Endotoxin was removed by 
extensive washing with 1% (v/v) Triton X-100 on a Ni-chelating Sepharose column in 
the presence of 6M guanidine-HCl (Gu-HCl). Purified HspE7 was exchanged into 
DPBS/15% (v/v) glycerol and stored at -70°C. 

10 

Example 6: Production of Recombinant M tuberculosis Hsp40 - HPV 16 E7 Fusion 
Protein (MT40-E7) . 

pETMT40E7 is a plasmid encoding chimeric recombinant protein MT40E7 
composed of Mycobacterium tuberculosis (strain H37RV - ATCC 27294) hsp40 

1 5 protein with hu HPV 1 6 (ATCC 45 1 1 3 ) E7 protein attached at the C-terminus of 
Hsp40. The plasmid was transformed into E. coli BL21(DE3) cells for protein 
production and purification. The nucleotide sequence (SEQ ID NO: 18) coding for 
expression of MT40-E7 (SEQ ID NO: 19) is shown in Figs. 5A-5B. 

The MT40-E7 protein was purified as follows. The Inclusion Body fraction 

20 was prepared as described above from E. coli BL21(DE3) cells transformed with 
plasmid pETMT40E7. MT40-E7 protein was purified using the following 
chromatographic steps: Q-Sepharose (100 ml column, Amersham Pharmacia), Ni- 
chelating Sepharose (70 ml, Amersham Pharmacia) under native conditions with serial 
washings containing 2% (v/v) Triton X-100 followed by serial washing to remove 

25 residual Triton X-100. The purity of the protein was checked by SDS-PAGE, the 
appropriate fractions pooled and dialyzed overnight at 4°C against DPBS/25% (v/v) 
glycerol. 

Example 7: Ovalbumin (OVA) 
30 Ovalbumin (Lot # 37H701 0) was purchased from Sigma Chemicals and purified 

by chromatography using 20 mL of Con A Sepharose (Amersham-Pharmacia), 
Fractions containing the purified product were pooled and dialyzed overnight against 
DPBS. 
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Example 8: Production of Recombinant M. bovis BCG Hsp65-Ovalbumin Fusion 
Protein (HspOva) 

A plasmid encoding HspOva was constructed as follows. The full length 
5 chicken ovaibumin-coding sequence was excised from pET/OVA with Nhe I and EcoR 
I digestion and purified from an agarose gel. The sequence coding for expression of 
OVA is shown in Fig. 6. The purified product was ligated to pET65H previously 
digested with the same enzymes. The ligation reaction was used to transform E. coli 
DHSalpha and putative clones containing the chicken ovalbumin gene insert were 

10 selected based on diagnostic restriction digestion. This initial restriction analysis was 
confirmed by DNA sequence analysis of the entire fusion gene, promoter and 
termination regions. DNA of the confirmed construct, named pET65H/OVA, was used 
to transform E. coli BL21(DE3). The nucleotide sequence (SEQ ID NO:20) coding for 
expression of HspOVA (SEQ ED NO:21) is shown in Figs. 7A-7C. 

15 The HspOva protein was purified as follows. The Inclusion Body fraction was 

prepared as described above from K coli BL21(DE3) cells transformed with plasmid 
pET65H/OVA. The HspOva fusion protein present in the Inclusion Body fraction was 
purified using the following chromatographic steps: Q-Sepharose (100 ml column, 
Amersham Pharmacia) and Ni-chelating Sepharose (60 ml, Amersham Pharmacia) 

20 under denaturing conditions with serial washings containing 2% (v/v) Triton X-l 00 
followed by serial washing to remove residual Triton X-l 00. The purity of the protein 
was checked by SDS-PAGE, the appropriate fractions pooled and dialyzed overnight at 
4°C against DPBS/15% (v/v) glycerol, followed by a dialysis against DPBS/2.5 %(w/v) 
sucrose. 

25 

Example 9: Production of Recombinant Glutathione-S-Transferase (GST) 

A plasmid encoding Gst was constructed as follows. The kanamycin resistance- 
coding sequence was excised from pET28a DNA with AlwN I and Xho I digestion and 
purified from an agarose gel. The purified product was ligated to pGEX-4T-2 that had 
30 been previously digested with the same enzymes. The ligation reaction was used to 
transform E. coli DHSalpha and putative clones containing the kanamycin resistance 
gene insert were selected based on diagnostic restriction digestion. This initial 
restriction analysis was confirmed by DNA sequence analysis of the entire insert 
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coding sequence, promoter and termination regions. DNA of the confirmed construct, 
named pGEX/K, was used to transform E. coli strain BL21(DE3). The nucleotide 
sequence (SEQ ID NO:22) coding for expression of GST (SEQ ID NO:23) is shown in 
Fig. 8. 

5 The GST protein was purified as follows. The Soluble fraction was prepared as 

described above from E. coli BL21(DE3) cells transformed with plasmid pGEX/K. 
The GST protein present in the Soluble Fraction was purified by Glutathione- Agarose 
Chromatography as follows. Approximately 20 mL of Glutathione-Agarose (Sigma- 
Aldrich; Cat. #: G4510) was equilibrated with DPBS, and mixed and incubated 

10 overnight with the sample at room temperature on a shaker. The next morning, the 
resin was packed into a column and serially washed with DPBS. Endotoxin was 
removed by washing with 2% (v/v) Triton X-100 followed by serial washing to remove 
residual Triton X-100. Finally, the protein was eluted using 10 mM glutathione 
(reduced form), 50 mM TRIS HC1, pH 8.0. 

15 

Example 10: Production of Recombinant Glutathione-S-Transferase - HPV 16 E7 

Fusion Protein (GST-ET) 

A plasmid encoding GST-E7 was constructed as follows. The HPV16 E7 

coding sequence was excised from pETOVA/E7 with BamH I and EcoR I digestion 
20 and purified from an agarose gel. The purified product was ligated to pGEX/K that had 

been previously digested with the same enzymes. The ligation reaction was used to 

transform E. coli DH5 alpha and putative clones containing the HPV16-E7 gene insert 

were selected based on diagnostic restriction digestion. This initial restriction analysis 

was confirmed by DNA sequence analysis of entire fusion gene, promoter and 
25 termination regions. DNA of the confirmed construct, named pGEX/K/E7, was used to 

transform E. coli strain BL21(DE3). The nucleotide sequence (SEQ ID NO:24) coding 

for expression of GST-E7 (SEQ ID NO:25) is shown in Fig. 9. 

The GST-E7 protein was purified as follows. Bacteria containing the 

expression vector pGEX/K/E7 were grown and the protein purified using the affinity 
30 chromatography procedure essentially as described above for GST. 
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Example 1 1 : Production of Recombinant HPV 1 6 E7 - Linker - M. bovis BCG Hsp65 
Fusion Protein (E7-L-BCG65) 

A plasmid encoding E7-L-BCG65 was constructed as follows. The HPV16 E7- 
coding sequence was PCR-amplified from pSK/HPV16 (ATCC) using primers w280 
5 and w396 (w280: CCA GCT GTA ACC ATG GAT GGA GAT (SEQ ID NO:26) and 
w396: GCC ATG GTA CTA GTT GGT TTC TGA GAA(SEQ ID NQ27:)). The PCR 
product was digested with restriction enzyme Nco I and Spe I and purified from an 
agarose gel. The purified PCR product was ligated to pET5'65 (pET5'65 is pET65 with 
a polyglycine linker sequence inserted at the 5 f aid of the M. bovis BCG hsp65 

1 0 sequence) that had been previously digested with the same enzymes. The ligation 
reaction was used to transform E. coli DH5alpha and putative clones containing the 
HPV16 E7 gene insert were selected based on diagnostic restriction digestion. This 
initial restriction analysis was confirmed by DNA sequence analysis of entire fusion 
gene, promoter and termination regions. DNA of confirmed construct, named 

1 5 pET/E7/5 > 65, was used to transform E. coli strain BLR(DE3). The nucleotide sequence 
(SEQ ID NO:28) coding for expression of E7-L-BCG65 (SEQ ID NO:29) is shown in 
Figs. 10A-10B. 

The E7-L-BCG65 protein was purified as follows. The Soluble Fraction was 
prepared as described above from E. coli BLR(DE3) cells transformed with plasmid 

20 pET/E7/5'65. The E7-L-BCG65 fusion protein present in the Soluble Fraction was 
purified using the following chromatographic steps: Butyl Sepharose (100 ml, 
Amersham-Pharmacia), Q-Sepharose (100 ml column, Amersham Pharmacia), 
Superdex 200 Gel Filtration (26/60 column, Amersham Pharmacia), and Ni-chelating 
Sepharose Fast Flow Chromotography (60 ml, Amersham Pharmacia) under denaturing 

25 conditions with serial washings containing 2% (v/v) Triton X-l 00 followed by serial 
washing to remove residual Triton X-l 00. The purity of the protein was checked by 
SDS-PAGE, the appropriate fractions pooled and dialyzed overnight at 4°C against 
DPBS. In order to reduce the amount of endotoxin contained in the sample, it was 
further purified using a pre-packed 1 ml column of DetoxiGel™ (Pierce, Rockford, IL, 

30 USA) according to the manufacturer's instructions. 
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Example 12: Production of Recombinant HPV 16 E7-M. bovis BCG Hsp65 
Fragment Fusion Protein (BCG65(F1)-E7) 

A plasmid encoding BCG65(F1)-E7 was constructed as follows. The first 600 
amino terminal base pairs of M. bovis BCG hsp65 gene were PCR-amplified from 
5 pET65C/E7-lN using primers w046 and w293 (w046: TTC GCC ATG GCC AAG 
ACA ATT GCG (SEQ ID NO:30) and w293: GTA CCC CGA CAT ATG GCC CTT 
GTC GAA CCG CAT AC(SEQ ED NO:31)). The PCR product was digested with the 
restriction enzymes Nco I and Nde I and purified from an agarose gel. The purified 
PCR product was ligated to pET65C/E7-lN that had been previously digested with the 

10 same enzymes. The ligation reaction was used to transform E. coli DH5 alpha and 

putative clones containing the truncated BCG65 gene were selected based on diagnostic 
restriction digestion. This initial restriction analysis was confirmed by DNA sequence 
analysis of the entire fusion gene, promoter and termination regions. The. confirmed 
plasmid construct, named pET65Fl/E7, was used to transform E. coli strain 

15 BLR(DE3). The nucleotide sequence (SEQ ID NO:32) coding for expression of 
BCG65(F1)-E7 (SEQ ID NO:33) is shown in Fig. 11. 

The BCG65(F1)-E7 protein was purified as follows. The Inclusion Body 
fraction was prepared as described above from E. coli BLR(DE3) cells transformed 
with plasmid pET65Fl/E7. The BCG65(F1)-E7 fusion protein present in the Inclusion 

20 Body fraction was purified using the following chromatographic steps: Source 15Q 
Sepharose (Amersham-Pharmacia) and Ni-chelating Sepharose (60 ml, Amersham 
Pharmacia) under denaturing conditions with serial washings containing 2% (v/v) 
Triton X-100 followed by serial washing to remove residual Triton X-100. The purity 
of the protein was checked by SDS-PAGE, the appropriate fractions pooled and 

25 dialyzed overnight at 4°C against DPBS. 

Example 13 : Production of Recombinant M. tuberculosis HsplO - HPV 16 E7 Fusion 
Protein (TB10-E7) 

Expression plasmid pETESE7 contains a chimeric gene composed of the 
30 Mycobacterium tuberculosis strain H37RV (ATCC 27294 ) groES (hsplO) coding 
sequence fused, at its 3' end, to the HPV1 6 (ATCC 45 1 13 ) E7 coding. The chimeric 
gene was cloned into expression vector pET28a and transformed into E. coli 
BL21(DE3) cells for protein production and purification. The nucleotide sequence 
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(SEQ ID NO:34) coding for expression of TB10-E7 (SEQ ID NO:35) is shown in Fig. 
12. 

The TBI 0-E7 protein was purified as follows. The Inclusion Body fraction was 
prepared as described above from E. coli BL21(DE3) cells transformed with plasmid 

5 pETESE7. The TB 1 0-E7 fusion protein present in the Inclusion Body fraction was 
purified using the following chromatographic steps: DEAE Sepharose (100 ml column, 
Amersham Pharmacia), Source 15Q Sepharose (100 ml column, Araersham Pharmacia) 
and Ni-chelating Sepharose (60 ml, Amersham Pharmacia) under denaturing conditions 
with serial washings containing 2% (v/v) Triton X-100 followed by serial washing to 

10 remove residual Triton X-100. The purity of the protein was checked by SDS-PAGE, 
the appropriate fractions pooled and dialyzed overnight at 4°C against DPBS/10 %(v/v) 
glycerol. 

Example 14: Production of Recombinant HPV 16 E7 - M, tuberculosis Hsp71 Fusion 

15 Protein (E7-TB71) 

A plasmid encoding E7-TB71 was constructed as follows. The M tuberculosis 
hsp71 gene was PCR-amplified from clone pY31 1 1/8 (Mehlert and Young (1989) 
MoLMicrobiol. 3:125-130) using primers w048 and w079 (w048: 5'-TTC ACC ATG 
GCT CGT GCG GTC GGG (SEQ ID NO:36) and w079: ACC TCC GCG TCC ACA 

20 GCT AGC TCA GCC(SEQ ID NO:37)). The PGR product was digested with Nco I 
and Nhe I, gel-purified and ligated to pET28a digested with the same enzymes to 
generate pET/7l. 

The HPV 16 E7-coding sequence was PCR-amplified from pSK/HPV16 
(ATCC) using primers w280 and w344 (w280: CCA GCT GTA ACC ATG GAT GGA 

25 GAT (SEQ ID NO:38) and w344: GGA TCA GAC ATG GCC ATG GCT GGT TTC 
TG (SEQ ID NO:39)). The PCR product was digested with restriction enzyme Nco I 
and purified from an agarose gel. The purified PCR product was ligated to pET/7 1 
DNA that had been previously digested with Nco I and CIAP to remove 5' phosphate. 
The ligation reaction was used to transform E. coli DHSalpha and putative clones 

30 containing the HPV1 6 E7 gene insert were selected based on diagnostic restriction 

digestion. This initial restriction analysis was confirmed by DNA sequence analysis of 
entire fusion gene, promoter and termination regions. The confirmed construct, named 
pET/E7/71, was used to transform E. coli strain BL21(DE3). The nucleotide sequence 
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(SEQ ID NO:40) coding for expression of E7-TB71 (SEQ ED NO:41) is shown in Figs. 
13A-13B. The resulting construct, pET/E7/71, was further modified (to complete 
sequences at the 3' end of the hsp71 gene) by replacement of a Kpn I to Nhe I fragment 
containing sequences from the 3' end of the hsp71 gene by a Kpn I- and Nhe I-digested 

5 PCR fragment amplified from pY3 1 1 1/8 using primers w391 and w392 (w391 : GAG 
GGT GGT TCG AAG GTA CC (SEQ ID NO:42) and w392: TTT GAT TTC GCT 
AGC TCA CTT GGC CTC(SEQ ED NO:43)). The resulting final plasmid, 
pET/E7/7r, expresses HPV16 E7 fused to the amino-terminus of full-length Hsp71 
protein and was used to transform E. coli strain BL2 1(DE3). The nucleotide sequence 

10 (SEQ ID NO:44) coding for expression of the fusion protein (SEQ ID NO:45) of 
pET/E7/71 ' is shown in Figs. 14A-14B. 

The E7-TB71 protein was purified as follows. The Inclusion Body fraction was 
prepared as described above from E. coli BL21(DE3) cells transformed with plasmid 
pET/E7/71\ The E7-TB71 fusion protein present in the Inclusion Body fraction was 

1 5 purified using the following chromatographic steps: Q-Sepharose (100 ml column, 
Amersham Pharmacia) and Ni-chelating Sepharose (80 ml, Amersham Pharmacia) 
under native conditions with serial washings containing 2% (v/v) Triton X-100 
followed by serial washing to remove residual Triton X-100. The purity of the protein 
was checked by SDS-PAGE, the appropriate fractions pooled and dialyzed overnight at 

20 4°C against DPBS/1 0 %(v/v) glycerol. 

Example 1 5 : Production of Recombinant Streptococcus pneumoniae HSP65 ( 2) - 
HPV 16 E7 Fusion Protein (SP65(2)-E7) 

A plasmid encoding SP65(2)-E7 was constructed as follows. The Streptococcus 
25 pneumoniae hsp65 gene was PCR-amplified from plasmid pETP60-2 (PCT patent 
application WO 99/35720) using primers w384 and w385 (w384: GCA GCC CCA 
TGG CAA AAG AAA (SEQ ID NO:46) and w385: GCT CGA ATT CGG TCA GCT 
AGC TCC GCC CAT (SEQ ID NO:47)). The PCR product was digested with Nco I 
and EcoR 1, gel-purified and ligated to pET28a digested with the same enzymes to 
30 generate pET/SP65-2C. 

The HPV16 E7-coding sequence was PCR-amplified from pSK/HPV16 
(ATCC) using primers wl33 and wl34 (wl33: AAC CCA GCT GCT AGC ATG CAT 
GGA GAT (SEQ ID NO:48) and wl34: AGC CAT GAA TTC TTA TGG TTT CTG 



31 



WO 01/04344 



PCT/US00/18828 



(SEQ ID NO:49)). The PCR product was digested with restriction enzymes Nhe I and 
EcoR I and purified from an agarose gel. The purified PCR product was then ligated to 
pET/SP65-2C that had been previously digested with Nhe I and EcoR I. The ligation 
reaction was used to transform E. coli DH5alpha and putative clones containing the 
5 HPV16 E7 insert were selected based on diagnostic restriction digestion. This initial 
restriction analysis was confirmed by DNA sequence analysis of entire fusion gene, 
promoter and termination regions. DNA of the confirmed construct, named 
pET/SP65c-E7, was used to transform £*. coli strain BLR(DE3). The nucleotide 
sequence (SEQ ID NO:50) coding for expression of SP65(2)-E7 (SEQ ID NO:51) is 

10 shown in Figs. 15A-15B. 

The SP65(2)-E7 protein was purified as follows. The Inclusion Body fraction 
was prepared as described above from E. coli BLR(DE3) cells transformed with 
plasmid pET/SP65c-E7. The SP65(2)-E7 fusion protein present in the Inclusion Body 
fraction was purified using the following chromatographic steps: Q-Sepharose (100 ml 

15 column, Amersham Pharmacia) and Ni-chelating (60 ml, Amersham Pharmacia) under 
denaturing conditions with serial washings containing 2% (v/v) Triton X-100 followed 
by serial washing to remove residual Triton X-100. The purity of the protein was 
checked by SDS-PAGE, the appropriate fractions pooled and dialyzed overnight at 4°C 
against DPBS. 

20 

Example 16: Recombinant Production of Aspergillus fumizatus Hsp60- HPV 16 E7 
Fusion Protein (AF60-E7) 

pETAF60E7 is a plasmid encoding a recombinant protein, AF60-E7, composed 
of the Aspergillus fumigatus (ATCC 26933) Hsp60 protein (without leader) (obtained 
25 as described in PCT/CA99/01 152) fused at its C-terminus to the HPV16 (ATCC 45 1 13) 
E7 protein sequence. Plasmid pETAF60E7 was used to transform E. coli BL21(DE3) 
cells for protein production and purification. The nucleotide sequence (SEQ ID 
NO:52) coding for expression of AF60-E7 (SEQ ID NO:53) is shown in Figs. 16A- 
16B. 

30 The AF60-E7 protein was purified as follows. The Inclusion Body fraction was 

prepared as described above from £. coli BL21(DE3) cells transformed with plasmid 
pETAF60E7. AF60-E7 protein was purified using the following chromatographic 
steps: Source 15Q Sepharose (Amersham-Pharmacia) and Ni-chelating Sepharose (60 
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ml, Amersham Pharmacia) under denaturing conditions with serial washings containing 
2% (v/v) Triton X-100 followed by serial washing to remove residual Triton X-100. 
The purity of the protein was checked by SDS-PAGE, the appropriate fractions pooled 
and dialyzed overnight at 4°C against DPBS. 

5 

Example 17: Stimulation of IFN-Gamma Release by a Hsp65-HPVE7 (HsnE7) 
Fusion Protein 

Pooled, unfractionated splenocytes were prepared from untreated naive 
C57BL/6 mice obtained from two different sources (Charles River Laboratory and 

10 Jackson Laboratory) and were plated in complete medium (complete RPMI) at 6 x 10 5 
cells/well in flat bottom 96-well tissue culture plates. Replicate cultures (5) were 
incubated for 72 hours with 0.05 to 1.4 nmol/mL concentrations of recombinant M. 
bovis BCG Hsp65 (Hsp65), HPV16 E7 (E7) or histidine-tagged E7 ((h)E7), an 
admixture of M bovis BCG Hsp65 and HPV16 E7 (Hsp65 + E7), or M bovis BCG 

1 5 Hsp65 - HPV1 6 E7 fusion protein (HspE7). Subsequent to incubation, cells were 
pelleted, and supernatants were transferred to IFN-gamma capture ELISA plates. 

After incubation, the replicate samples were harvested, pooled in eppendorf 
tubes and pelleted at 1200 rpm for 7 minutes in Beckman GS-6R centrifuge (300 x g). 
The supernatants were removed into cryovials and frozen at -70°C until time of 

20 analysis. 

Maxisorp ELISA plates (Nunc cat# 442404A) were coated overnight at 4°C 
with 1 fig/mL purified rat anti-mouse IFN-gamma (PharMingen cat. no 1818 ID) in 
0.1 M NaHC0 3 buffer, pH 8.2. The plates were washed with 0.05% Tween 20 in PBS 
then blocked with 3% BSA (albumin fraction V; Amersham cat. no 10857) in DPBS 

25 (blocking buffer) for 2 hours. After the plates were washed, recombinant mouse IFN- 
gamma (8000, 4000, 2000, 1000, 500, 250, 125, 62.5 pg/mL in complete RMPI) was 
placed in triplicate onto each ELISA plate. Sample supernatants were removed from - 
70°C, thawed quickly at 37°C, and placed undiluted onto the ELISA plates in duplicate. 
The samples were then serially diluted by seven, 3 -fold dilutions in complete RPMI 

30 followed by incubation at 4°C overnight. Background ELISA values were established 
by measuring eight wells containing all reagents except the target antigen. 

Detection of bound murine IFN-gamma was accomplished using 1 ng/mL of a 
rat anti-mouse IFN-gamma biotin conjugate (PharMingen cat. no 181 12D) in blocking ' 
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buffer. Following washing, bound biotin-conjugated antibody was detected using a 
1:1000 dilution of a streptavidin-alkaline phosphatase conjugate (Caltag cat. no 
SA1008). The plates were washed as before followed by the addition of a chromogenic 
substrate, p-nitrophenyl phosphate (pNPP; Sigma cat# N-2765) at 1 mg/mL in 
5 diethanolamine buffer, pH 9.5. After 30 minutes incubation, the color reaction was 
stopped using 50 (iL of 100 mM EDTA, pH 8.0. The absorbance was measured at 410 
nm using a Dynatech MR5000 ELISA plate reader equipped with Biolinx 2.0 software. 
The levels of IFN-gamma detected in test samples were extrapolated from the standard 
curves generated on each of the respective ELISA plates. Data is expressed as IFN- 

1 0 gamma released (pg/mL ± SD). 

Results of assays are shown in Figs. 17A-17B. The averages from five 
replicates are shown along with the standard deviation. Substantial secretion of IFN- 
gamma was elicited by exposure of splenocytes to 0.05, 0.15, 0.46 and 1.4 nmol/mL 
HspE7. Hsp65 alone, E7 alone, hE7 alone, and an admixture of Hsp65 and E7 were 

1 5 virtually incapable of stimulating IFN-gamma release. Similar results were obtained 
with splenocytes prepared from mice obtained from the Charles River Laboratory (Fig. 
17A) and from the Jackson Laboratory (Fig. 17B). 

Example 18: Stimulation of IFN-Gamma Release by a HspE7 Fusion Protein in 
20 Splenocyte Cultures from Mice Having Different Genetic Backgrounds 

Experiments similar to those presented in Example 17 were carried out using 
splenocytes from mice (from Jackson Laboratory) of three different haplotypes: 
C57BL/6 (H-2 b ); Balb/c (H-2 d ); and C3HeB/FeJ (H-2 k ). The relative effects of the 
fusion protein on the different splenocyte preparations were similar, although there 
25 were differences in the absolute amounts of IFN-gamma released: the observed order 
being Balb/c (highest; Fig. 18A), C57BL/6 (intermediate; Fig. 18B), and C3HeB/FeJ 
(lowest; Fig. 18C). As in Example 17, substantially increased IFN-gamma release was 
induced by HspE7, but not by E7 alone, Hsp65 alone, or an admixture of E7 and 
Hsp65. 
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Example 19: Stimulation of IFN-Gamma Release by Fusion Proteins is Independent 
of the Nature of the Linked Antigen but Requires a Linked Stress Protein Moiety 

Experiments were performed as discussed under the previous examples. It was 
observed that stimulation of naive splenocytes by (h)E7 or Hsp65 (M. bovis BCG) 

5 produced negligible IFN-gamma release, but that fusion proteins containing E7 and 
Hsp65 (M bovis BCG) or Hsp40 (M tuberculosis) substantially enhanced IFN-gamma 
release (Fig. 19). Virtually no induction of IFN-gamma release was mediated by a 
fusion protein containing E7 and glutathione-S-transferase (GST). When a fusion 
protein including an ovalbumin fragment and an Hsp (M bovis BCG Hsp65) was 

10 tested, high levels of IFN-gamma release were detected. The IFN-gamma release 

mediated by the HspOVA fusion protein exceeded that resulting from addition of OVA 
alone to the cell culture. These results demonstrate that the induced release of IFN- 
gamma is not dependent on the presence of the E7 antigen in the fusion protein, but that 
other antigens fused to an Hsp can similarly enhance IFN-gamma production. 

15 

Example 20: Stimulation of IFN-Gamma Release by E7 Fusion Proteins Having 
Different Stress Protein Moieties 

Experiments were performed as discussed under the previous examples. 
HPV16 E7 was fused to different Hsps, i.e., M tuberculosis HsplO (TB10-E7), M. 
20 bovis BCG Hsp65 (HspE7), Streptococcus pneumoniae Hsp65 (2) (SP65(2)-E7), and 
Aspergillus fumigatus Hsp60 (AF60-E7). Furthermore, in two cases (E7-L-BCG65 and 
E7-TB71) the Hsp (M bovis BCG Hsp65 and M. tuberculosis Hsp71, respectively) was 
added to the carboxy terminus of the E7 antigen instead of to the amino terminus as in 
the other fusions. 

25 Additionally, one construct was tested, in which the E7 antigen was linked to 

the amino terminal one third (residues 1-200) of the M. bovis BCG Hsp65 sequence 
(BCG65(F1)-E7), rather than an intact Hsp. It was observed (Figs. 20A-20B) that 
stimulation of IFN-gamma release occurred upon exposure of splenocytes to all the 
different fusion proteins, although differences in the magnitude of the responses were 

30 noted. Thus, fusions containing different Hsps, including Hsp65 from different 

organisms as well as different types of Hsps, were capable of eliciting enhanced IFN- 
gamma release. Furthermore, fusions containing a stress protein at either the amino 
terminal end or at the carboxy terminal end of the E7 antigen were active. Finally, 
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BCG65(F1)-E7, containing amino acids 1-200 of M. bovis BCG Hsp65, induced IFN- 
gamma secretion in a manner similar to the full-length Hsp65 sequence (HspE7). 

Exam ple 21 : Stimulation of IFN-Gamma Release by HspE 7 Fusion Protein in Lymph 

5 Node Cell Cultures 

To test for their ability to induce IFN-gamma release, various concentrations of 
the HspE7 proteins (diluted to the desired starting concentration in complete medium, 
defined as RPMI 1640 with 10% fetal calf serum) were added as replicate samples (3 to 
5 replicates) to flat bottom 96-well tissue culture plates. For the cellular component of 

1 0 the assay, three inguinal lymph nodes were aseptically removed from untreated 

C57BL/6 mice and placed in 5 ml of Hank's balanced salt solution supplemented with 
5% fetal calf serum (medium). Following their transfer to a sterile 0.22 micron nylon 
mesh, a sterile syringe plunger was used to disperse the cells through the mesh. 
Medium was used to rinse the cells, yielding a pooled, unfractionated single cell 

1 5 suspension. Cells were washed once, resuspended in complete medium and added to 
wells at 6 x 10 5 cells/well, to a final volume of 0.2 ml. Cultures were exposed to the 
HspE7 protein in medium or to medium alone for 72 hours at 37°C in a 5% C0 2 
atmosphere. Following incubation, replicate cultures were pooled, cells pelleted by 
centrifugation and supematants either measured for IFN-gamma content by ELISA 

20 according to the procedure described in Example 17, or frozen immediately at -70°C 
for later analysis. 

Fig. 21 shows the results of the above experiment, comparing induction of IFN- 
gamma release by lymph node cells and by splenocytes. The fusion protein was found 
to elicit a release of IFN-gamma in both cell types. The IFN-gamma release elicited by 
25 the fusion protein greatly exceeded that induced by Hsp65 alone. 

Example 22: Regression of Pre-Established Tumors in vivo Induced by 
Administration of Hsp Fusion Proteins 

Human papilloma vims type 16 (HPV16) is an infectious agent associated with 
30 the induction of cervical cancer and its premalignant precursor, cervical intraepithelial 
neoplasia. The following experiments use Hsp - HPV16 E7 fusion proteins of the 
invention to target immune recognition as part of a strategy to eliminate HPV16 E7- 
expressing host cells. 
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The H-2 b murine epithelial cell-derived tumor line, TC-1 (co-transformed with 
HPV16 E6 and E7 and activated human Ha-nzs), was obtained from T.C. Wu of Johns 
Hopkins University (Baltimore, MD). The use of TC-1 cells in assays similar to those 
used herein is described in PCT patent application WO 99/07860. TC-1 was 
maintained in complete medium, consisting of: RPMI 1640 (ICN, cat no. 1260354) 
supplemented with 10% FBS (Hyclone, cat no. SH30071); 2 mM L-Glutamine (ICN, 
cat no. 16-801-49); 10 mM HEPES (ICN, cat no. 16-884-49); O.lmM MEM Non 
Essential Amino Acid Solution (Gibco BRL, cat no. 1 1 140-050); 1 mM MEM Sodium 
Pyruvate (Gibco BRL, cat no. 1 1360-070); 50 |iM 2-Mercaptoethanol (Sigma, cat no. 
M-7522); and 50 mcg/mL Gentamycin Sulfate (Gibco BRL, cat no. 15750-01 1). The 
medium was also supplemented with G418 (0.4 mg/mL active, Gibco BRL, cat no. 
1 1 81 1-023) and Hygromycin B (0.2 mg/mL active, Calbiochem, cat no. 400051). 

Since the TC-1 cell line was derived from a C57BL/6 mouse, this mouse strain 
was used as the host in these experiments. Female C57BL/6 mice of approximately 8 
to 10 weeks of age were purchased from Charles River Canada (St-Constant, Quebec, 
Canada) and housed using filter top cages (four animals per cage). 

TC-1 cells were prepared for implantation as follows. TC-1 cells were seeded 
at a density of 2 - 5x1 0 4 cells /mL and incubated for two to four days until 70 to 90% 
confluent. Cells were trypsinized using a 30 second exposure to 0.25% Trypsin (lOx 
stock, Gibco cat. no. 1505-065, diluted to lx with DPBS), then diluted four-fold with 
supplemented complete medium. Following trypsinization, TC-1 cells were pelleted at 
4°C at 1000 rpm (250x g) for 4 minutes, the supernatant removed by aspiration and 
30 mL of cold DPBS added. The cells were then pelleted at 4°C at 700 rpm (lOOx g) 
for 4 minutes, the supernatant removed by aspiration, and a minimal amount (approx. 
5 mL) of cold DPBS added. The final cell density for injection was adjusted to 6.5 x 
10 5 viable cells per mL, as measured by the trypan blue dye exclusion method. At least 
90% of the cells used for TC-1 inoculations were viable. The cells were stored on ice 
for immediate injection into mice. 

TC-1 cells were implanted as follows. Between 24 to 72 hours prior to 
implantation, the hind flank of each mouse was shaved. TC-1 cells were prepared as 
described above and held on ice until injected. All injections were performed within 
two hours of cell trypsinization. The cells were swirled gently in the centrifuge tube 
and drawn into a 1 mL syringe (Becton-Dickinson, cat. no. 309602) without a needle. 
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A 25 gauge needle (Becton-Dickinson, cat. no. 305122) was then attached and any air 
bubbles were expelled. The shaved skin was raised gently and the needle was inserted 
bevel side up just beneath the skin surface. Cells (1 .3 x 1 0 5 ) were injected in a 0.2 mL 
volume for all studies. A fresh syringe and needle was used for every fifth injection. 

Fusion proteins were injected as follows. On treatment days, the fusion proteins 
HspE7, SP65(2)-E7, AF60-E7, E7-TB71 (shown if Figs. 23A and 23B as E7-MT71), 
MT40-E7 and TB10-E7 (prepared as described above) were removed from -70°C 
storage and thawed in a 37°C water bath. Dulbecco's phosphate buffered saline 
(DPBS) (4°C) was added to obtain the protein concentration desired for injection. The 
diluted fusion protein was held on ice until drawn into a 1 mL syringe (Becton- 
Dickinson, cat no. 309602) with a 30 gauge needle (Becton-Dickinson, cat no. 
3095 106). The same syringe was used to inject 0.2 mL of fusion protein into each 
mouse within a dose group; the syringe was refitted with a fresh needle for every fifth 
injection. Mice were injected subcutaneously in the scruff of the neck, as high on the 
neck as possible. 

Tumor incidence (TI) was measured as follows. TI was generally recorded 
three times per week, beginning eight days after tumor implantation and continuing for 
eight weeks. Mice were assessed for the presence or absence of subcutaneous tumor by 
palpation and visual observation of the tumor injection site. 

Tumor volume was measured as follows. Volumes of palpable subcutaneous 
tumor nodules were measured beginning on approximately Day 8 post implantation. 
The two longest orthogonal dimensions were measured using a Fowler Sylvac Ultra- 
Cal Mark III digital caliper with computerized data collection. Data points were 
tabulated in a Microsoft Excel spreadsheet. Tumor nodule measurements were 
extrapolated to mm 3 using the formula V = W 2 x L x 0.5 (where V represents volume, 
W represents width and L represents length) and are presented as average tumor 
volume ± standard error of the mean. The Student's t test function of Excel (two-tailed, 
unpaired samples, equal variances) was used to test the significance (p < 0.05) of the 
difference of the means of tumor volumes in each group. 

Seven different HPV16 E7 fusion proteins linked to various hsps were tested for 
their ability to regress a tumor in vivo. 

In the first experiment, C57BL/6 mice (18 per group) were inoculated 
subcutaneously with 1 .3 x 10 5 TC-1 cells in the right hind flank (Day 0). After 7 days, 
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groups of mice were treated with 0.2 mL of either DPBS (saline), 1 15 ug HspE7, 100 
ug SP65(2)-E7, or 100 ug AF60-E7. The doses of the two latter proteins were chosen 
based on the same molar equivalent of E7 contained in HspE7. The mice were 
monitored for the presence or absence of tumor in addition to tumor volume. The data 
are represented as percent tumor incidence (TI) per group (Fig. 22A) and tumor 
volume, expressed as average tumor volume ± standard error of the mean (Fig. 22B). 

As indicated in Fig. 22A, the majority of animals had detectable tumor by Day 8 
post implantation and by Day 13 tumor was evident in 94 to 100% of the mice. After 
this timepoint, TI in all of the mice declined until day 25 when the incidence for the 
DPBS-treated animals stabilized to approximately 50% for the remainder of the 
observation period. In contrast, the animals treated with fusion proteins showed a 
comparatively sharp decline in TI until day 28, when none of the animals had 
detectable tumor. This complete absence of tumor was observed for the remainder of 
the observation period for most of these animals. The complete regression of tumor in 
the animals treated with the fusion proteins was also clearly seen when measured by 
tumor volume. Figure 22B shows that by day 28, the average tumor volume of the 
animals treated with the fusion proteins was not detectable. By comparison, the 
average tumor volume of those animals treated with DPBS rose steadily from day 25 
onwards. 

In the second experiment, C57BL/6 mice (18 per group) were inoculated 
subcutaneously with 1.3 x 10 5 TC-1 cells in the right hind flank (Day 0). After 7 days, 
groups of mice were treated with 0.2 mL of either DPBS (saline), 100 ug HspE7, 100 
ug MT40-E7, 100 ug E7-TB71 (shown if Figs. 23A and 23B as E7-MT71), or 100 ug 
TB 1 0-E7. The mice were monitored for the presence or absence of tumor in addition to 
tumor volume. The data are represented as percent tumor incidence (TI) per group 
(Fig. 23A) and tumor volume, expressed as average tumor volume ± standard error of 
the mean (Fig. 23 B). 

As in Figure 22A, a majority (approximately 95%) of the animals had visible 
and palpable tumors on day 8 post tumor implantation (Fig. 23 A). By day 19, a 
decrease in TI was apparent. Following this, a sharp decrease in TI for all of the fusion 
protein-treated animals was observed such that by day 33, practically all of the animals 
were tumor-free. In contrast, the TI of the mice treated with DPBS had stabilized to 
approximately 75%. Fig. 23B shows the average tumor volumes of the mice treated 
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with the respective fusion proteins. The decrease in TI was reflected by the marked 
decrease in tumor volumes. Average tumor volumes for the animals treated with any of 
the fusion proteins was essentially not measurable by day 30. 
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What is claimed is: 

1 . A method of determining whether a fusion protein stimulates a Thl -like 
response, the method comprising: 

5 (a) providing a cell sample comprising naive lymphocytes in vitro; 

(b) providing a fusion protein comprising (i) a heat shock protein (Hsp) or a 
fragment thereof at least eight amino acid residues in length, fused to (ii) a 
heterologous polypeptide at least eight amino acid residues in length; 

(c) contacting the cell sample with the fusion protein; and 

I o (d) determining whether the fusion protein stimulates a Thl -like response in 

the cell sample. 

2. The method of claim 1 , wherein the Hsp is selected from the group 
consisting of Hsp65, Hsp40, HsplO,Hsp60, andHsp71. 

15 

3 . The method of claim 2, wherein the fusion protein comprises a 
polypeptide selected from the group consisting of Hsp65, Hsp40, Hspl 0, Hsp60, and 
Hsp71. 

20 4. The method of claim 1 , wherein the fusion protein comprises amino 

acids 1-200 of Hsp65 of Mycobacterium bovis. 

5 . The method of claim 1 , wherein the heterologous polypeptide comprises 
a sequence identical to at least eight consecutive amino acids of (i) a protein of a human 

25 pathogen or (ii) a tumor associated antigen. 

6. The method of claim 1 , wherein the heterologous polypeptide comprises 
a sequence identical to at least eight consecutive amino acids of a protein of a human 
virus. 

30 

7. The method of claim 6, wherein the virus is selected from the group 
consisting of human papillomavirus (HPV), herpes simplex virus (HSV), hepatitis B 
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virus (HBV), hepatitis C virus (HCV), cytomegalovirus (CMV), Epstein-Bair virus 
(EBV), influenza virus, measles virus, and human immunodeficiency virus (HIV). 

8. The method of claim 7, wherein the heterologous polypeptide comprises 
5 HPVE6. 

9. The method of claim 7, wherein the heterologous polypeptide comprises 
HPV E7. 

10 10. The method of claim 1 , wherein the heterologous polypeptide comprises 

HPV 16 E7 or a fragment thereof at least eight amino acid residues in length. 

1 1 . The method of claim 1 , wherein the heterologous polypeptide comprises 
HPV 16 E6 or a fragment thereof at least eight amino acid residues in length. 

15 

12. The method of claim 10, wherein the fusion protein comprises 
Mycobacterium bovis Hsp65 and HPV 16 E7. 

13 . The method of claim 1, wherein the cell sample comprises cells derived 
20 from a spleen, lymph node, peripheral blood, bone marrow, thymus, lung, respiratory 

tract, or anogenital mucosa 

14. The method of claim I, wherein the cell sample comprises splenocytes 
or lymph node cells. 

25 

15. The method of claim 1, wherein the detecting step comprises detecting 
IFN-gamma produced by the cell sample. 

1 6. The method of claim 1 , comprising the further steps of 

30 (e) providing a second cell sample comprising naive lymphocytes; 

(f) contacting the second cell sample with a second fusion protein; and 

(g) determining whether the second fusion protein stimulates a Thl-like 
response in the second cell sample, 
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wherein the first fusion protein comprises the sequence of a full-length, 
naturally occurring Hsp, and the second fusion protein comprises at least eight amino 
acids but less than all of the sequence of a naturally occurring Hsp. 

17. A method of screening a compound, the method comprising: 

(a) providing a cell sample comprising naive lymphocytes in vitro; 

(b) providing a fusion protein comprising (i) a Hsp or a fragment thereof at 
least eight amino acid residues in length, fused to (ii) a heterologous polypeptide at 
least eight amino acid residues in length; 

(c) contacting the cell sample with the compound and the fusion protein; 

and 

(d) determining whether the cell sample exhibits a Thl-like response 
following the contacting step, 

wherein a decrease in the Thl-like response in the presence of the compound 
compared to in the absence of the compound indicates that the compound inhibits a 
Thl-like response by the cell sample. 

1 8. The method of claim 17, wherein the determining step comprises 
detecting IFN-gamma produced by the cell sample. 

1 9. The method of claim 17, wherein the cell sample comprises cells derived 
from a spleen, lymph node, peripheral blood, bone marrow, thymus, lung, respiratory 
tract, or anogenital mucosa 

20. The method of claim 17, wherein the cell sample comprises splenocytes 
or lymph node cells. 

21 . The method of claim 17, wherein the Hsp is selected from the group 
consisting of Hsp65, Hsp40, Hsp 10, Hsp60, and Hsp71. 

22. The method of claim 2 1 , wherein the fusion protein comprises a 
polypeptide selected from the group consisting of Hsp65, Hsp40, HsplO, Hsp60, and 
Hsp71. 
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23. The method of claim 17, wherein the heterologous polypeptide 
comprises HPV E6. 

5 24. The method of claim 17, wherein the heterologous polypeptide 

comprises HPV E7. 

25. The method of claim 17, wherein the fusion protein comprises 
Mycobacterium bovis Hsp65 and HPV 16 E7. 

10 

26. A method of screening a compound, the method comprising: 

(a) providing a cell sample comprising naive lymphocytes in vitro; 

(b) providing a fusion protein comprising (i) a Hsp or a fragment thereof at 
least eight amino acid residues in length, fused to (ii) a heterologous polypeptide at 

1 5 least eight amino acid residues in length; 

(c) contacting the cell sample with the compound and the fusion protein; 

and 

(d) determining whether the cell sample exhibits a Th 1 -like response 
following the contacting step, 

20 wherein an increase in the Thl-like response in the presence of the compound 

compared to in the absence of the compound indicates that the compound promotes a 
Thl-like response by the cell sample. 

27. The method of claim 26, wherein the determining step comprises 
25 detecting IFN-gamma produced by the cell sample. 

28. The method of claim 26, wherein the cell sample comprises cells derived 
from a spleen, lymph node, peripheral blood, bone marrow, thymus, lung, respiratory 
tract, or anogenital mucosa 

30 

29. The method of claim 26, wherein the cell sample comprises splenocytes 
or lymph node cells. 
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30. The method of claim 26, wherein the Hsp is selected from the group 
consisting of Hsp65, Hsp40, HsplO, Hsp60, and Hsp71. 

31. The method of claim 30, wherein the fusion protein comprises a 

5 polypeptide selected from the group consisting of Hsp65, Hsp40, Hsp 1 0, Hsp60, and 
Hsp71. 

32. The method of claim 26, wherein the heterologous polypeptide 
comprises HPV E6. 

10 

33. The method of claim 26, wherein the heterologous polypeptide 
comprises HPV E7. 

34. The method of claim 26, wherein the fusion protein comprises 
1 5 Mycobacterium bovis BCG Hsp65 and HPV 16 E7. 

35. A method of determining whether a hybrid compound stimulates a Thl - 
like response, the method comprising: 

(a) providing a cell sample comprising naive lymphocytes in vitro; 
20 (b) providing a hybrid compound that is non-naturally occurring and 

comprises (i) a non-peptide compound having a molecular weight of less than 1,500, 
covalently linked to (ii) a polypeptide of at least eight amino acids in length, wherein 
the hybrid compound is made by covalently linking the non-peptide compound to the 
polypeptide; 

25 (c) contacting the cell sample with the hybrid compound; and 

(d) determining whether the hybrid compound stimulates a Thl -like 
response in the cell sample. 

36. The method of claim 35, wherein the non-peptide compound has a 
30 molecular weight of at least 100. 

37. A method of determining whether a hybrid compound stimulates a Thl - 
like response, the method comprising: 
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(a) producing a hybrid compound by covalently linking a non-peptide 
compound to a polypeptide of at least eight amino acids in length; 

(b) providing a cell sample comprising naive lymphocytes in vitro; 

(c) contacting the cell sample with the hybrid compound; and 

5 (d) determining whether the hybrid compound stimulates a Thi -like 

response in the cell sample. 

38. The method of claim 37, wherein the non-peptide compound has a 
molecular weight between 100 and 1,500. 

10 

39. A method of determining whether a fusion protein stimulates a Thl-like 
response, the method comprising: 

(a) providing a cell sample comprising naive lymphocytes in vitro; 

(b) providing a fusion protein comprising (i) a first polypeptide at least eight 
1 5 amino acids in length, fused to (ii) a second polypeptide at least eight amino acids in 

length; 

(c) contacting the cell sample with the fusion protein; and 

(d) detecting a Thl-like response exhibited by the cell sample following the 
contacting step. 

20 

40. The method of claim 39, wherein the detected Thl-like response is 
greater than a Thl-like response exhibited by a second cell sample comprising naive 
lymphocytes when the second cell sample is contacted with either the first polypeptide, 
the second polypeptide, or a mixture of the first polypeptide and the second 

25 polypeptide. 

41. The method of claim 40, wherein the detected Thl-like response is at 
least two times greater than the Thl-like response exhibited by the second cell sample. 

30 42. The method of claim 40, wherein the detected Thl-like response is at 

least five times greater than the Thl-like response exhibited by the second cell sample. 
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43. A fusion protein comprising (i) a Hsp 10 protein or a fragment thereof at 
least eight amino acid residues in length, and (ii) a heterologous polypeptide at least 
eight amino acids in length. 

5 44. The fusion protein of claim 43, comprising a HsplO protein. 

45 . The fusion protein of claim 44, wherein the Hsp 1 0 protein is a 
mycobacterial protein. 

10 46. The fusion protein of claim 45, comprising the Mycobacterium 

tuberculosis HsplO protein. 

47. The fusion protein of claim 43, wherein the heterologous polypeptide 
comprises a sequence identical to at least eight consecutive amino acids of a protein of 

15 a human virus. 

48. The fusion protein of claim 47, wherein the human virus is HPV. 

49. The fusion protein of claim 48, wherein the heterologous polypeptide 
20 comprises HP VI 6 E7. 

50. A fusion protein comprising (i) a Hsp40 protein or a fragment thereof at 
least eight amino acid residues in length, and (ii) a heterologous polypeptide at least 
eight amino acids in length. 

25 

5 1 . The fusion protein of claim 50, comprising a Hsp40 protein. 

52. The fusion protein of claim 51, wherein the Hsp40 protein is a 
mycobacterial protein. 

30 

53. The fusion protein of claim 52, comprising the Mycobacterium 
tuberculosis Hsp40 protein. 
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54. The fusion protein of claim 50, wherein the heterologous polypeptide 
comprises a sequence identical to at least eight consecutive amino acids of a protein of 
a human virus. 

5 55 . The fusion protein of claim 54, wherein the human virus is HPV. 

56. The fusion protein of claim 55, wherein the heterologous polypeptide 
comprises HPV16 E7. 

10 57. A fusion protein comprising (i) a Hsp71 protein or a fragment thereof at 

least eight amino acid residues in length, and (ii) a heterologous polypeptide at least 
eight amino acids in length. 

5 8 . The fusion protein of claim 5 7, comprising a Hsp7 1 protein. 

15 

59. The fusion protein of claim 58, wherein the Hsp71 protein is a 
mycobacterial protein. 

60. The fusion protein of claim 59, comprising the Mycobacterium 
20 tuberculosis Hsp71 protein. 

61 . The fusion protein of claim 57, wherein the heterologous polypeptide 
comprises a sequence identical to at least eight consecutive amino acids of a protein of 
a human virus. 

25 

62. The fusion protein of claim 61, wherein the human vims is HPV. 

63 . The fusion protein of claim 62, wherein the heterologous polypeptide 
comprises HPV 1 6 E7. 

30 

64. A method of determining whether a compound stimulates a Thl-like 
response, the method comprising: 

(a) providing a cell sample comprising naive lymphocytes in vitro; 
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(b) providing a compound; 

(c) contacting the cell sample with the compound; and 

(d) detecting a Thl-like response exhibited by the cell sample following the 
contacting step. 

5 
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1/37 



3/1 33/11 

at? gCC AAG ACA AIT QCG TAC GAC GAA GAG GCC CGT COC GGC CTC GAG CGG CCC TTG AAC 
maX ti XXYDB2AHRGLSRGLN 
63/21 " «/31 

GCC CTC GCC GAT GCG CTA AAG GTG ACA TIG GGC CCC AAG GGC CGC AAC GTC CTC CTG GAA 
AL ADAVKVTLGPKGRNVVLE 
123/41 153/51 

AAG AAG TGG COT GCC CCC ACG ATC ACC AAC GAT GGT GTG TCC ATC GCC AAG GAG ATC GAG 
KKW GAPTITNDGVSIAKE1E 
183/61 " 213/71 

CTO GAG GAT CCG TAC GAG aatf acc ggc GCC GAG CTG GTC AAA GAG OTA GCC AAG AAG ACC 
LEDPYEKXGAELVKEVAKKT 
243/81 273/91 

gat gac gic gcc ggt gac ggc acc acg acg gcc acc gtg ctg gcc cag ccg ttg gtt cgc 
ddvagdgtttatvlaqalvr 

303/101 333/111 

gAg GGC CTG CGC AAC GTC QCG GCC CGC GCC AAC CCG CTC GGT CTC AAA CGC GGC ATC GAA 
EGLRUVAAGANPLOLKRG1 E 
363/121 393/131 

AAG GCC GTG GAG AAG GTC ACC GAG ACC CTG CTC AAG GGC GCC AAG GAG GTC GAG ACC AAG 
KAVE XVTETLLXGAXEVETK 
423/141 453/151 

GAG CAG ATT GOG GCC ACC GCA GCG ATT TCG GCG GOT GAC CAG TCC ATC GOT GAC CTG ATC 
BQI AATAAISAGDQSIGDLI- 

483/151 513/171 

GCC GAG GCG ATG GAC AAG GTG GGC AAC GAG GGC GTC ATC ACC GTC GAG GAG TCC AAC ACC 
ABAMDKVGNEGVITVEESNT 
543/181 573/191 

TTT GGG CTG CAG CTC GAG CTC ACC GAG GGT ATG COG TTC GAC AAG GGC TAC ATC TOG GGG 
PG LQi.ELTEGMRFDKGYISG 
603/201 633/211 

tac ttc gtg acc gac ccg gag cot cag gag gcg gtc ctg gag gac ccc tac atc ctg ctg 
yfvtdperqeavledpyill 

663/221 693/231 

GTC ACC TCC AAG CTO TCC ACT GTC AAG GAT CTG CTG CCG CTG CTC GAG AAG GTC ATC GGA 
VSS KVSTVKDLI.PIil*EXVI G 
723/241 753/251 

GCC CGT AAG CCG CTG CTG ATC ATC GCC GAG GAC GTC GAG GGC GAG GOG CTG TCC ACC CTG 
AGKPLLIIAEDVEGEALSTL 
783/261 813/271 

GTC GTC AAC AAG ATC CGC GGC ACC TIC AAG TCG GTG GCG GTC AAG GCT CCC GGC TTC GGC 
V VU KIRGTFKSVAVKAPGFG 
843/201 873/291 

GAC CGC CGC AAG GCG ATG CTG CAG GAT ATG GCC ATT CTC ACC GOT GCT CAG GTG ATC AGC 
DRRKAMLQDMAILTGGQVIS 

903/301 933/311 

GAA GAG GTC GGC CTG ACG CTG GAG AAC GCC GAC CTG TCG CTG CTA GGC AAG GCC CGC AAG 
EE VG LTLEtfADLS LLGKARK 
963/321 993/331 

GTC GTG OTC ACC AAG GAC GAG ACC ACC ATC GTC GAG GGC GCC GGT GAC ACC GAC OCC ATC 
VVVT KD ETT"VEGAGDTDAl 
1023/341 1053/351 

GCC GGA CGA GTG GCC CAG ATC CGC CAG GAG ATC GAG AAC ACC GAC TCC GAC TAC GAC CGT 
AGRVAQXRQZIENSDSDYDR 
1083/361 1113/371 

GAG AAG CTG CAG GAG CGC CTG GCC AAG CTG GCC GGT GGT GTC GCG GTG ATC AAG GCC GGT 
EKLQERLAKLAGGVAVIKAG 

1143/381 1173/391 

GCC GCC ACC GAG CTC GAA CTC AAG GAG CGC AAG CAC CGC ATC GAG GAT GCG GTT CQC AAT 
AATEVELKERKHRIEDAVRN 

1203/401 1233/411 

GCC AAG GCC GCC GTC GAG GAG GGC ATC GTC GCC GGT GGG GGT GTG ACG CTG TTG CAA GCG 
AKAAVEEGIVAGGGVTLLQA 
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a/1 33/11 

ATG <JAT GGA GAT ACA CCT ACA TTG CAT GAA TAT ATG TTA GAT TTG CAA CCA GAG ACA ACT 

«3GDTPTLHEYMLDLQP£TT 
53/21 93/31 

gat ctc tac tot tat gag caa tta aat gac agc tca gag gag gag gat gaa ata gat got 
olycyeqlndsseeedei dg 

123/41 153/51 

CCA GCT GGA CAA GCA GAA CCG GAC AGA GCC CAT TAC AAT ATT GTA ACC TTT TGT TCC AAG 

PAGQAEPDRAHYNXVTFCCK 
183/61 213/71 

TGT GAC TCT ACG CTT CGG TTG TGC GTA CAA AGC ACA CAC GTA GAC ATT CGT ACT TTG GAA 

C DSTLRLCVQSTHVDIRTLE 
243/81 273/91 

GAC CTG TTA ATG GGC ACA CTA GGA ATT GTG TGC CCC ATC TOT TCT CAG AAA CCA TAA 
DLLMGTLGrVCPlCSQXP 
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ATO CCC AGC AGC CAT CAT CAT CAT CAT CAC AGC AGC GGC CTG GTG CCG CQC GGC AGC CAT 

MGS5BKK»HKSSGLV?RGSH 
63/21 93/31 

ATS gcc age ATS CAT GGA GAT ACA CCT ACA TTG CAT GAA TAT AlC TTA GAT TTC CAA CCA 
HASMHGDTPTLHEYMLDLQ? 
123/41 153/51 

GAG ACA ACT GAT CTC TAC TGT TAT GAG CAA TTA AAT GAC AGC TCA GAG GAG GAG GAT GAA 

ETTDLYCYEQLNDSSEEEDE 
183/61 213/71 

ATA GAT GGT CCA GCT GGA CAA GCA GAA CCG GAC AGA GCC CAT TAC AAT ATT OTA ACC TTT 

IDOPAGQAEPDRAHYNIVTP 
243/81 273/91 

TGT TGC AAG TGT GAC TCT ACG CTT CGG TTC TGC GTA CAA AGC ACA CAC OTA GAC ATT CGT 
CCKCDSTLRLCVQSTHVDIR 
303/101 333/111 

ACT TTG GAA GAC CTC TTA ATG GGC ACA CTA GGA ATT GTS TGC CCC ATC TGT TCT CAG AAA 

TLEDLLMGTLGIVCPICSOK 
363/121 
CCA TAA 
P * 
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atg gOC AAG ACA ATT GCG TAC GAC GAA GAG GCC CGT CGC GGC CTC GAG COG GGC TTQ AAC 

93/31 

GCC CTC GCC GAT GCG GTA AAG GTG ACA TTG GGC CCC AAG GGC CGC AAC GTC GTC CTG GAA 
ALADAVKLVTLGPKGRNVVLE 

153/51 

AAG AAG TGG GGT GCC CCC ACG ATC ACC AAC GAT GOT GTG ICC ATC GCC AAG GAG ATC GAG 
KKWGAPTITN 213/71 V S 

^GAG GAT CCG TAC GAG aag ate JSC GCC GAG CTG GTC AAA GAG GTA GCC AAC AAG ACC 
LEDPYEKICAELVKEVAKKT 

lAl/fll 273/91 

GAT GAC GTC GCC GGT GAC GGC ACC ACG ACG GCC ACC GTG CTG GCC GAG GCG TTG CTT CGC 
ODVAGDGTTTATVLAQALVK 

vw/ifll 333/lU 
gS'JS CTGCCCAACCTCGCGGCCGCCG<XAACCCGCTCG^ 

E GLRNVAAGANPLGLKRGI E 
tfit/vn 393/131 _ 

KAVEKVTETLLKGA KEVETR 
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££ CAG ATT GCG GCC ACC GCA GCG ATT ICG GCG GGT GAC CAG TCC ATC GGT GAC CTG ATC 

eqiaataaisagdqs I Q D u t 

Ifll/lfil 513/171 „ . - 

GCC GAG GCG ATG GAC AAG GTG GGC AAC GAG GGC GTC ATC ACC GTC GAS GAG TCC AAC ACC 
AEAMDKVGNE I 

TTt'oCG CTG CAG CTC GAG CTC ACC GAG GGT ATG CGG TIC GAC AAG GGC TAC ATC TOG GOG 
FGLQLELTSCMRrDKCYXSG 

Sc'tTC GTGACCGACCCGCAGCGTCAGGAG Sg'gS CTG GAG GAC CCC TAG ATC CTC CTG 
Y FVTDPERQEAVLEDPYILL 

t,, 693/231 

GTC AGC TCC AAG GTG TCC ACT GTC AAG GAT CTG CTG CCG CTG CTC GAG AAG GTC ATC GGA 
VSSXVSTVKDLLPt-LBKVI^ 

753/251. 

GCC^GGT AAG CCG CTG CIG ATC ATC GCC CAG GAC GTC GAG GGC GAG GCG CTG TCC ACC CTG 
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813/271 

gtc'gtc AACAAGATCCKGGCACCTTCAAGTCGGTOCCCWA^ 

VVNKlRGTFKSVAVKAPGf v» 

873/291 

GAc'cGC CGC AAG GCG ATG CTG CAG GAT ATG GCC ATT CTC ACC GOT GOT CAG GTG ATC AGC 
D.RRKAMLQDMAl^LTOGQVi 

GAA^GAG GTC GGC CTG ACG CTG GAG AAC GCC GAc'cTG TCQ CTG CIA GGC AAG GCC CGC AAG 
BEVGLTLENADLSLLGKARK 

GTc'gtG GTC ACC AAG GAC GAG ACC ACC ATC QTC^GAG GGC GCC GGT GAC ACC GAC GCC ATC 
VVVTKDETTIVEGAGDTDAX 

GCC^CGA GTG GCC CAG ATC CGC CAG GAG ATCGAg'aAC AGC GAC TCC GAC TAC GAC CGT 
AGRVAQIRQEIE NSDSDYDR 

GAG^CTG CAG GAG CGG CTG GCC AAG CTG GCc'gGt'ccT GTC GCG GIG ATC AAG GCC GGT 
EKLQBRLAKLAGGVAVIKA 

GCc'gCc'aCC GAG GTC GAA CTC AAG GAG CGC So'clc'cGC ATC GAG GAT GCG GTT CGC AAT 
AATEVELKERKKRIEDAVKi 

1203/401 ™ 3 '±\ 
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KVALEAPLKQIAFNSGLSPG 
1383/461 1413/471 

GTG GTG GCC GAG AAG GTG CGC AAC CTG CCG GCT GGC CAC QGA CTG AAC CCT CAG ACC GOT 
VVAEKVRNLPAGHGLNAQTG 
1443/481 1473/491 

GTC TAC GAG GAT CTG CTC GCT GCC GGC GTT GCT GAC CCG CTC AAG GTO ACC COT TCG GCG 
VYEDLLAAGVADPVKVTRSA 
1503/S01 1533/S11 

CTG CAG AAT GCG GCG TCC ATC GCG GGG CTG TTC CTG ACC ACC GAG GCC GTC GTT GCC GAC 
LQNAAS IAGLFLT T S A V V A D 
1563/521 1593/531 

AAG CCG GAA AAG GAG AAG GCT TCC GTT CCC GGT GGC GGC GAC ATG GOT GGC ATC GAT TTC 
KP 3 K E K A S V PGGG DMGGMDF 
1623/541 1653/551 

cat atg cat gga gat aca cct aca ttg cat gaa cat atg tea gat ttg caa cca gag aca 
HMHGDTPTLHEYHLDLQ. PET 
1683/551 1713/571 

act gat etc tac tgt tat gag caa tea aat gac age tea gag gag gag gac gaa aca gat 
TDLYCYEQLNDSSEEEDEID 
1743/581 1773/591 

ggt cca get gga caa gea gaa ccg gac aga gec cat tac aat att gxa acc etc tgt tgc 
GPAGQAEPDRAHYNIVTFCC 
1803/601 1833/611 

aag tgt gac cct acg ctt egg ttg tgc gta caa age aca cac gta gac att cgt act ttg 
KCDSTLRLCVQSTHVDIRTL 
1363/621 1893/631 

gaa gac ccg tta atg ggc aca eta gga att gtg tgc ccc ate tgt tct cag aaa cca taa 
EDLLMGTLGIVCPICSQKP* 
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EQLNDSSEEEDEXDGFAGQA 
1363/441 1393/451 

GAA CCG GAC AGA GCC CAT TAC AAT ATT GTA ACC TTT TGT TGC AAG TGT GAC TCT ACC CTT 
EPDRAKYNIVTFCCXCD STL 
1423/461 1453/471 

CGG TTG TGC GTA CAA AGC ACA CAC GTA GAC ATT CGT ACT TTG GAA GAC CTG TTA ATG GGC 

RLCVQSTHVQTRTLEDLLMG 

1483/491 1513/491 

ACA CTA GGA ATT GTG TGC CCC ATC TGT TCT CAG AAA CCA TAG 

TLGIVCPICSQKP* 
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ATG GCC CAA ACQ GAA TGG QIC GAA AAA GAC TTC TAC CAG GAG CTG GGC G1C TCC TCT GAT 
MAQREWVSKDFYQELCVSSD 
103/21 , U3/31 

GCC ACT CCX GAA GAG ATC AAA COT GCC TAT CGG AAG TTG GCG CGC GAC CTG CAT CCG GAC 
ASPEEIKRAYRKLARDLHPD 
163/41 193/51 

GCG AAC CCG GGC AAC CCG GCC GCC GGC GAA CGG TTC AAG GCG GTT TCG GAG GCG CAT AAC 
ANFGNPAAGERFKAVSEAHN 
223/61 253/71 

GIG CTG TCG GAT CCG GCC AAG CGC AAG GAG TAC GAC GAA ACC CGC CGC CTG TTC GCC GGC 
VLS DPAKRKEYDETRRLFAG 
283/81 313/91 

GGC GGG TTC GGC GGC CGT CGG TTC GAC AGC GGC TTT GGG GGC GGG TTC GGC GGT TTC GGG 
GGFGGRRFDSGFGGGFGGFG 
343/101 373/111 

GTC GGT GGA GAC GGC GCC GAG TTC AAC CTC AAC GAC TTG TTC GAC GCC GCC AGC CGA ACC 
VGGDGAEFKLNBLFDAASRT 
403/121 433/131 

GGC GGT ACC ACC ATC GGT GAC TTG TTC GGT GGC TTG TTC GGA CGC GGT GGC AGC GCC CGT 
GGTTIGDLFGGLFGRGGSAR 
463/141 493/151 

CCC AGC CGC CCG CGA CGC GGC AAC GAC CTG GAG ACC GAG ACC GAG TTG GAT TTC GTG GAG 
PSRPRRGNDLETETELDFVE 
523/161 553/171 

GCC GCC AAG GGC GTG GCG ATG CCG CTG CGA TTA ACC AGC CCG GCG CCG TGC ACC AAC TGC 
AAKGVAKPLRLTSPAPCTNC 
583/181 613/191 

CAT GGC AGC GGG GCC CGC CCA GGC ACC AGC CCA AAG GTG TGT CCC ACT TGC AAC GGG TCG 
HGSGARPGTSPKVCPTCNGS 

643/201 673/211 

GGC GTG ATC AAC CGC AAT CAG GGC GCG TTC GGC TTC TCC GAG CCG TGC ACC GAC TGC CGA 
GV INRNQGAFGFSEPCTDCR 
703/221 733/231 

GGT AGC GGC TCG ATC ATC GAG CAC CCC TGC GAG GAG TGC AAA GGC ACC GGC GTG ACC ACC 
GSG SI IBHPCEECKGTGVTT 
763/241 793/251 

CGC ACC CGA ACC ATC AAC GTG CGG ATC CCG CCC GGT GTC GAG GAT GGG CAG CGC ATC CGG 
R T R T I NVRIPPGVEDGQRIR 
823/251 353/271 

CTA GCC GGT CAG GGC GAG GCC GGG TTG CGC GGC GCT CCC TCG GGG GAT CTC TAC GTG ACG 
LAG Q G EAGLRGAPSGDLVVT 
833 /2B1 913/291 

GTG CAT CTG CGG CCC GAC AAG ATC TTC GGC CGC GAC GGC GAC GAC CTC ACC GTC ACC GTT 
VKVRPDKIFGROGDDLTVTV 

943/301 973/311 

CCG GTC AGC TTC ACC GAA TTG GCT TTG GGC TCG ACG CTG TCG GTG CCT ACC CTG GAC GGC 
PVSFTELALGSTLSVPTLDG 
1003/321 1033/331 

ACG GTC GGG GTC CGG GTG CCC AAA GGC ACC GCT GAC GGC CGC ATT CTG CGT CTG CGC GGA 
TVGVRVPKGTADGRILRVRG 
1053/341 1093/351 

CGC GGT GTO CCC AAG CGC AGT GGG GGT AGC GGC GAC CTA CTT GTC ACC GTG AAG GTG GCC 
KGVPKRSGGSGDLLVTVKVA 

1123/361 1153/371 

GTG CCG CCC AAT TTG GGA GGC GCC GCT CAG GAA GCT CTG GAA GCC TAT GCC GCG GCG GAG 
VPPNLAGAAQEA^EAYAAAE 
1183/381 1213/391 

CGG TCC AGT GGT TTC AAC CCG CGG GCC GGA TGG GCA GGT AAT CGC ATG CAT GGA GAT ACA 
RS SGFNPRAG WAGNRMHGDT 
1243/401 1273/411 

CCT ACA TTG CAT GAA TAT ATG TTA GAT TTG CAA CCA GAG ACA ACT GAT CTC TAC TOT TAT 
PTLHEYMIiDLQPETTDLYCY 
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ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC AGC GGC CTG CTG CCG CGC GOC AGC CAT 
MGSSHHHHHHSSGLVPRGSH 
53/21 33/31 

ATG get age acg ggc tec acc ggc gca gca age a eg gaa tee cgc etc gat gca ttc aag 

M A S M G S IGAASMEFC F D V ? X 
123/41 153/51 

gag etc aaa gtc cac car gec aac gag aac acc ttc .tac tgc ccc ate gec ate atg tea 
ELKVHHANENIFYCPIAIMS 
183/61 213/71 

get eta gee atg gta tac ccg ggt gca aaa gac age acc agg aca cag at* aac aag gtt 
ALAMVYLGAKDSTRTQINKV 
243/81 273/91 

get cgc ttt gat aaa etc cca gga tec gga gac age act gaa get cag tgt ggc aca tec 
VRFDKLPGFGDSIEAQCGTS 
303/101 333/111 

gta aac gtt cac cct cca ccc aga gac acc ccc aac caa ate acc aaa cca aac gac gec 
VNVKSSLRDILNQITXPNDV 
363/121 393/131 

tat teg ttc age ccc gec age aga etc cac gec gaa gag aga tac cca ate ctg cca gaa 
YSFSLASRLYAEERYPILPE 
423/141 453/151 

tac ttg cag tgt gtg aag gaa ctg tat aga gga ggc ttg gaa ccc acc aac tct caa aca 
YLGCVKELYRGGLEPINFQT 
483/161 513/171 

get gca gat caa gec aga gag etc ate aat tec tgg gta gaa agt cag aca aac gga ett 
AADQARELINSWVB 5QTNCI 
543/181 573/191 

ate aga aat gtc ett cag cca age tee gtg gat tct caa act gca atg get ctg gtt aac 
IRNVLQPSSVDSQTAKVLVN 
603/201 633/211 

gec att gtc ttc aaa gga ctg tgg gag aaa aca ttt aag gat gaa gac aca caa gca atg 
AIVFKGLWEKTFKDEDTQAM 
663/221 693/231 

cct ttc aga gtg aet gag caa gaa age aaa cct gtg cag atg atg tac cag att ggr tta 
PFRVTEQESKPVQMMYQIGI* 
723/241 753/2S1 

ttt aga gtg gca tea atg get tct gag aaa atg aag ate ccg gag cet cca ttt gec agt 
PRVASMASEKMKILiELPFAS 
783/261 813/271 

ggg aca a eg age acg ccg gcg ccg ccg ccc gac gaa gec tea ggc ccc gag cag ccc gag 
GTMSMLVLLPDEVSGLEQLE 
843/281 873/291 

agt ata ate aac etc gaa aaa ccg act gaa tgg acc agb tct aat gtt atg gaa gag agg 
SI INFEKLTEWTSSNVMESR 
903/301 933/311 

aag ate aaa gtg tac tta cct cgc atg aag atg gag gaa aaa tac aac etc aca tct gtc 
KIKVYLPRMXMEEKYNLTSV 
963/321 993/331 

tta acg get acg ggc acc acc gac gcg ccc age cct cca gec aat ccg ccc ggc acc ccc 
LMAMC1TDVFSSSANLSGIS 
1023/341 1053/351 

tea gca gag age ctg aag ata tct caa get gtc cat gca gca cat gca gaa ate aat gaa 
SAESLX I SQAVHAAHAE INE 
1083/361 1113/371 

gca ggc aga gag gtg gta ggg tea gca gag gee gga gcg gat get gca age gtc tec gaa 
AGREVVGSAEAGVDAASVSE 
1143/381 1173/391 

gaa ttt agg get gac cac cca ccc ccc ccc cgc acc aag cac ate gca acc aac gec get 
E .FRADH PFLFCIXHIATNAV 
1203/401 

etc ttc ttt ggc aga tgt get gga tec taa 
LFFGRCVGS* r-is-% 
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acg gCC AGC AGC CAT CAT CAT CAT CAT CAC AGC AGC GGC CTG GTG CCG CGC GGC AGC cat 
MO 5 S HKHHHHS SGLVPRG S H 
63/21 93/31 

atg GCC AAG ACA AIT GCG TAC GAC GAA GAG GCC COT CGC GGC CTC GAG CGG GGC TTG AAC 
MA KT ZAYDEEARRGLERGLN 
123/41 153/51 

GCC CTC GCC GAT GCG GTA AAG GTG ACA TTG GGC CCC AAG GGC CGC AAC GTC GTC CTG GAA 
AL ADAVKVTLG PKGRNVVLE 
133/61 213/71 

AAG AAG TGG GGT GCC CCC ACS ATC ACC AAC GAT GGT GTG TCC ATC GCC AAG GAS ATC GAG 
KKWG A PTITNDGVS IAKEIE 
243/81 273/91 

CTC GAG GAT CCG TAC GAG Aag ate ggc GCC GAG CTG GTC AAA GAG GTA GCC AAG AAG ACC 
LEDPYEXIGAELVKEVAKKT 

303/101 333/111 

GAT GAC GTC GCC GGT GAC GGC ACC ACG ACG GCC ACC GTG CTG GCC CAG GCG TTG GTT CGC 
DDVAG DCTTTATVLAQALVR 
363/121 393/131 

gAg GGC CTG CGC AAC GTC GCG GCC GGC GCC AAC CCG CTC GGT CTC AAA CGC GGC ATC GAA 
EGLRKVAAGANPLGLKRGIE 
423/141 453/151 

AAG GCC GTG GAG AAG GTC ACC GAG ACC CTG CTC AAG GGC GCC AAG GAG GTIC GAG ACC AAG 
KAVEXVTETLLKGAK2VETK 
483/161 513/171 

GAG CAG ATT GCG GCC ACC GCA GCG ATT TCG GCG GGT GAC CAG TCC ATC GGT GAC CTO ATC 
EQ IAATAAISAGDGSIGDLX 
543/181 573/191 

GCC GAG GCG ATG GAC AAG GTG GGC AAC GAG GGC GTC ATC ACC GTC GAG GAG TCC AAC ACC 
AEAMDXVGNEGVITVEESNT 
603/201 633/211 

TTT GGC CTG CAG CTC GAG CTC ACC GAG GGT ATG CGG TIC GAC AAG GGC TAC ATC TCG GGG 
F G LQLBLTEGMRFDKGYISG 
663/221 693/231 

TAC TTC GTG ACC GAC CCG GAG CGT CAG GAG GCG GTC CTG GAG GAC CCC TAC ATC CTG CTG 
YFVTDPERQEAVLEDPY1LL 
723/241 753/251 

OTC AGC TCC AAG GTG TCC ACT GTC AAG GAT CTG CTG CCG CTG CTC GAG AAG GTC ATC GGA 
V S SXVSTVKDLLPLLEKVIG 

783/261 813/271 

GCC GGT AAG CCG CTG CTG ATC ATC GCC GAG GAC GTC GAG CCC GAG GCG CTG TCC ACC CTG 
AGKPLLIIAEDVEGEALSTL 
343/281 B73/291 

GTC GTC AAC AAG ATC CGC GGC ACC TTC AAG TCG GTG GCG GTC AAG GCT CCC GGC TTC GGC 
VVNKXRGTFKSVAVKAPGFG 
903/301 933/311 

GAC CGC CGC AAG GCG ATG CTG CAG GAT ATG GCC ATT CTC ACC GGT GGT CAG GTG ATC AGC 
DKRKAMLQDMAILTGGQVIS 

963/321 993/331 

GAA GAG GTC GGC CTG ACG CTG GAG AAC GCC GAC CTG TCG CTG CTA GGC AAG GCC CGC AAG 
EEVGLTLEKADLSLLGKARK 

1023/341 1053/351 

GTC GTG GTC ACC AAG GAC GAG ACC ACC ATC GTC GAG GGC GCC GGT GAC ACC GAC GCC ATC 
VVVTKDETTIVEGAGDTDAI 

1083/361 1113/371 

GCC GGA CQA GTG GCC CAG ATC CGC CAG GAG ATC GAG AAC AGC GAC TCC GAC TAC GAC CGT 
AGRVAQIRQE1ENSDSDYDR 
1143/3B1 1173/391 

GAG AAG CTG CAG GAG CGC CTG GCC AAG CTG GCC GGT GGT GTC GCG GTG ATC AAG GCC GGT 
EKLQE RLAKLAGGVAVIKAG 

1203/401 1233/411 

GCC GCC ACC GAG GTC GAA CTC AAG GAG CGC AAG CAC CGC ATC GAG GAT GCG GTT CGC AAT 
AATEVELKERKKRIEDAVRN 



FIG. 7A 

SUBSTITUTE SHEET (RULE 26) 



WO 01/04344 



PCT/US0O/18828 



10/37 

1263/421 1293/431 

1323/441 ^53/451° G v T L L Q A 

5^ 5^ A ^ ^*CC AAC ATC GTG 

^^^raM^^raAAGC^ATCCXXT^ 
1443/481 . 1473/491 

1503^ R N ' ? ^533/5U H ° ^ * * Q T G 

GTC TAC GAG GAT CTG CTC GCT GCC GOC CTT GCT GAC CCG GTC AAG GTG ACC CCT cm 

1553/521 1593/531 

CTG CAG AAT GCC GCG TCC ATC GCG GGG CTG TTC CTG ACC ACC GAG GCC GTC GTT GCC Gju- 

?«,?«a, M AASIAGLFL TTE A VVAD 
1623/541 - 1653/551 

AAG CCG GAA AAG GAG AAG GCT TCC CTT CCC GGT GGC GGC GAC ATG GGT GGC ATG GAT TTT? 

1633/561 1713/57X * 

get age ATG ggc tec ace gee gca oca age acg gaa ttt tgt ttt gat ata ttc aaa aaa 
A SMGS IGAASME7CFDVPK ? 
1743/S81 1773/591 

ccc aaa gee cac cat gee aac gag aac ate ttc cac tgc ccc att gee ate acg tea act 
LXVHHANSNIPYCPIAIM^A 
1803/601 1833/611 

eta gee atg gta tac ctg ggt gca aaa gac age ace agtj aca cag ata aac aaa etc etc 

LAMVYLGAKDSTRTQXNK VV 
1863/621 1893/631 

cgc ttt gat aaa ctt cea gga etc gga gac agt ate gaa get cag tgt ggc aca tct gta 

Rr.DXLPGFGDSXEAQCGTSV 
1923/641 1953/651 

aac gtt cac tct tea ctt aga gac ace ccc aac caa ate acc aaa cca aat gat gtt tac 

NVHSSLRDILNQITKPNOVY 
1983/661 2013/671 

teg ttc age ctt gec agt aga ctt tat get gaa gag aga tac cca ate ctg cca gaa tac 
S F S LA S RLYAEERY P XL PBY 
2043/681 2073/691 

ttg cag tgt gCg aag gaa ctg tat aga gga ggc teg gaa ccc ate aac ttt caa aca get 

LQCVKELYRGGLEPINFQTA 
2103/701 2133/711 

gca gac caa gec aga gag ccc acc aat tec tgg gta gaa agt cag aca aat gga att ate 
ADQARELINSWVESQTNGIX 
2163/721 2193/731 

aga aat gtc ctt cag cca age tec gcg gat tct caa acc gca atg gtt ctg gtt aat gee 

RNVLQPSSVDSQTAMVLVNA 
2223/741 2253/751 

att gtc tte aaa gga ccg tgg gag aaa aca ttt aag gat gaa gac aca caa gca atg ccc 

IVFKGLWEKTFKDEDTOAMP 
2283/761 2313/771 

ttc aga gtg act gag caa gaa age aaa cct gcg cag acg acg tac cag att ggt tta tec 

FRVTEQESKPVQMMYQIGLF 
2343/781 2373/791 

aga gtg gca tea atg get cct gag aaa acg aag ate ctg gag ctt eea ett gec agt ggg 

RVASMASSKMKILELPFASG 
2403/801 2433/811 

aca atg age acg Ctg gtg ctg ttg cct gac gaa gec cca ggc ctt gag cag ctt gag agt 

TMSMLVLLPOEVSGLEQLES 
2463/821 2493/831 

ata ate aac tct gaa aaa ceg act gaa tgg acc age ccc aac get atg gaa gag agg aag 
X INFEKLTE MTSSMVMEERK. 
2523/841 2553/851 

ate aaa gcg tac tta ccc cgc acg aag acg gag saa aaa cac aac etc aca tec gec cca 

I KVYLPRMKMEEJCYNLTSVL 
2583/861 2613/871 

atg get acg ggc act act gac gcg ccc age tct tea gec aac ccg tec ggc ate tec cca 
MAMGITDVFSSSANLSGISS 
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2643/881 2673/891 

eea oag age etg aag ata tct caa gee gee cat gca gca cat gca gaa aee aat gaa gca 
gea gag age " x g v K A K H A E I N E A 

2703/901 2733/911 

ggc aga gag gcg 9" COT tea gea gag get gga gcg gat get gea age gee tet gaa gaa 

C R E V V G SAEAGVDAASVSEE 
9763/921 2793/931 

ttt agg get gae eat eea tee etc tte egt ate aag eae ate gea aec aae gee gtt etc 
FR ADHPFLFClKrfIATNAV4- 

2823/941 

tte ttt gge «g* tgt gtt gga tee TAA 

pfgrcvgs* fig> ?c 



1/1 31/11 

acg tec cce ata cca ggc cat tgg aaa ate aag ggc ctt gcg caa ccc act cga ett etc 
H SPtLGYWKlKGLVQPTRi.l> 
61/21 91/31 

ttg gaa eae cet gaa gaa aaa tat gaa gag eae teg tat gag cgc gae gaa ggt gat aaa 

I E YLEEKYEEHLY ERD EGDK 
121/41 151/S1 

egg cga aac aaa aag tec gaa eeg ggt ttg gag ttt ccc aat ett ect tat tat aec gat 

WRNKKFELGLZFPNL PYYID 
181/61 211/71 

ggt gat gtt aaa tta aea ea, tet acg gee ate ata cgc tat ata gee gae aag eae aae 
GDVKLTQSMAI I RY 
241/81 271/91 

atg teg ggt ggt egt eea aaa gag egt gea gag ate tea atg ett gaa gga gcg get ttg 

MLGGCPKEHAElSMLEGAVi. 

301/101 331/111 

gat aee aga tae ggt gtt teg aga ate gca tat age aaa gae ett gaa act etc aaa gtt 

DIRY'GVSRIAJfS KDF ET-> kv 
3S1/121 391/131 

gat tte cet age aag eta ect gaa acg eeg aaa aeg tte gaa gat egt tea tgt eat aaa 
Splskl pemlkmfedrlchk 

421/141 451/151 

aea cat tta aat ggt gat eae gea aec eat ect gae tte aeg ttg tat gae get cet gae 
TYLNGDHVTHPDFMfcYDALD 

481/161 511/171 

get gtt tta tae atg gae cca atg tge etg gat gcg tte eea aaa tea get egt tet aaa 
VVLYMDPMCLDAFPKLVCFK 

541/181 571/191 
• aaa egt att gaa get ate cca caa att gae aag tae ttg aaa tec age aag eat aea gea 
KR1EAX PQIDKYLK S S R " 1 * 
601/201 631/211 

tgg eet ttg cag gge tgg caa gee aeg tet ggt gge ggc gae eat eet eea aaa eeg gae 
w pLQGWQATPGGGDBPPXSU 
SSI/221 691/231 

«g St eeg egt gga tec eea gga att ccc ggg teg aCT CGA GCA cca CCA CCA CCA cca 

LVPRCSFGlPGSTRAPFc'*' 



721/241 
CTG AGA, TCC GGC TGC TAA 
L R S G C * 
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1/1 3i/ii 

atg tec ccc ata cca gge tat tgg aaa act aag ggc etc gtg caa ccc acc cga etc ~tt 

cw.f PILGYWKIK<5: - V ^ PT RLL W 
51/21 91/31 

teg gaa tat etc gaa gaa aaa tat gaa gag cat ttg tat gag cgc gat caa get eat *** 

LEYLEEKVESHLyBRDEGOK 
121/41 151/51 

tgg cga aac aaa aag etc gaa ttg ggc ttg gag ttt ccc aac etc cct tat tat att cac 

wrnkkfelglefpnlfyyt tT 

181/61 211/71 

ggc gac gec aaa tta aca cag tct atg gec ate ata cgc cac aca get gac aag cac aac 

GDVKLTQSMAIIRYI ADK H N 
241/81 271/91 

atg teg ggt ggt tgt cca aaa gag cgc gca gag att tea aeg etc gaa gga gee get tea 
MLGGCPKERAEISMLEG A V L 
301/101 331/111 

gac att aga tac ggc gtc teg aga att gca tat age aaa gac cct gaa act etc aaa gtc 

OIRYGVSRIAYSK DFET^KV 
361/121 391/13X 

gat tec cct age aag cca cct gaa atg ctg aaa atg tec gaa gat cgt tta tgt cat aaa 
DFLSKL PEMLXMFEdrlc'hk 
421/141 451/151 

aca cat cca aac ggt gat cat gta acc cat ccc gac tec atg teg tat gac get ctt gat 

TYLNGDHVTHPDFMLYOALD 
481/161 511/171 

gtt gtc tta tac atg gac cca atg cgc ccg gat gcg ttc cca aaa tta gtt tgt ttt aaa 
VVLYMD PMCLDAF PKLVCFK 
541/181 571/191 

aaa cgt att gaa get ate cca caa att gac aag tac teg aaa tec age aag cat ata gca 

K R I E A I PQIDKYLKSSKY1A 
601/201 631/211 

tgg cct ttg cag ggc tgg caa gec acg tct ggt ggt ggc gac cat cct cca aaa teg gat 
WPLQGWQATF GGGDHPPKSD 
661/221 691/231 

ctg gtt ccg cgt gga tec ATG CAT GGA GAT ACA CCT ACA TTG CAT GAA TAT ATG TTA GAT 

LVFR-GSMHGDTFT&KSYMLD 
721/241 751/251 

TIG CAA CCA GAG ACA ACT GAT CTC TAC TGT TAT GAG CAA TTA AAT GAC AGC TCA GAG GAG 
LQPETTOLYCYEQLNDSSEE 
781/261 811/271 

GAG GAT GAA ATA GAT GGT CCA GCT GGA CAA GCA GAA CCG GAC AGA GCC CAT TAC AAT ATT 
EDE1DG PAGQAEPDRAHYNI 
841/281 871/291 

GTA ACC TTT TCT TGC AAG TGT GAC TCT ACG CTT CGG TTC TGC CTA CAA AGC ACA CAC GTA 
VTFCCKCDSTLRLCVQ STHV 
901/301 931/311 

GAC ATT CGT ACT TTG GAA GAC CTG TTA ATG GGC ACA CTA CGA ATT GTC TGC CCC ATC TGT 

DIRTLEDOLHGTLGIVCPIC 
961/321 

TCT CAG AAA CCA TAA 
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3/1 33/11 

A TG GAT GGA GAT ACA CCT ACA TTG CAT GAA TAT ATG TTA GAT TTG CAA CCA GAG ACA ACT 
MDGDTPTUHEYMLDLQPETT 
S3/21 93/31 

GAT CTC TAC TC7T TAT GAG CAA TTA AAT GAG AGC TCA GAG GAG GAG GAT GAA ATA GAT GGT 
Dt*Y C YEQLNDS SSEEDEIDG 
123/41 i53/51 

CCA GCT GGA CAA GCA GAA CCG GAC AGA GCC CAT TAC AAT ATT GTA ACC TTT TGT TGC AAG 
PAGQAEPDRAHYNIVTFCCK 
183/61 213/71 

TGT GAC TCT ACG CTT CGG TTG TGC GTA CAA AGC ACA CAC GTA GAC ATT CGT ACT TTG GAA 
CDSTLRLCVQS THVDIRTLS 
243/81 273/91 

GAC CTC TTA ATG GGC ACA CTA GGA ATT GTG TGC CCC ATC TGT TCT CAC AAA CCA ACT AGT 
DLLMGTLGIVCPICSQKPTS 
303/101 333/111 

GOT GGC GGT GGC GGC GGA TCC CAC ATG GCC AAG ACA ATT CCG TAC GAC GAA GAG GCC CGT 
GGGGGGSRMAKTIAYDSEAR 
353/121 393/131 

CGC GGC CTC GAG CGG GGC TTG AAC GCC CTC GCC GAT GCG GTA AAG GTG ACA TTG GGC CCC 
RGLERGLNALADAVKVTLGP 

423/141 453/151 

AAG GGC CGC AAC GTC GTC CTG GAA AAG AAG TQG GGT GCC CCC ACG ATC ACC AAC GAT GGT 
KGRNVVLBKKWGAPTITNDG 

483/161 513/171 

GTO TCC ATC GCC AAG GAG ATC GAG CTG GAG GAT CCG TAC GAG AAG ATC GGC GCC GAG CTG 
VSIAKEIELEDPYEXICAEL 
543/181 573/191 

GTC AAA GAG GTA GCC AAG AAG ACC GAT GAC GTC GCC GGT GAC GGC ACC ACG ACG GCC ACC 
VKEVAKKTDDVAGDGTTTAT 

G03/201 S33/211 

GXQ CIO GCC CAG GCG TTG GTT CGC GAG GGC CTG CGC AAC GTC GCG GGC GGC GCC AAC CCQ 
VLAQALVREGLRNVAAGANP 

663/221 693/231 

CTC GGT CTC AAA CGC GGC ATC GAA AAG GCC GTG GAG AAG GTC ACC GAG ACC CTG CTC AAG 
LGLK.RCIBKAVEKVTETLLK 

723/241 753/251 

GGC GCC AAG GAG GTC GAG ACC AAG GAG CAG ATT GCG GCC ACC GCA GCG ATT TCG GCG GGT 
GAKEVETKEQIAATAAISAG 
763/261 813/271 

GAC CftG TCC ATC GGT GAC CTG ATC GCC GAG GCG ATG CAC AAG GTG GGC AAC GAG GGC CTC 
D OS IGDLIAEAMDKVGNEGV 

843/281 873/291 

ATC ACC GTC GAG GAG TCC AAC ACC TTT GGG CTG CAC CTC GAG CTC ACC GAG GGT ATG CGG 
TTVEBSNTFGLQLELTEG MR 

903/301 933/311 

TTC GAC AAG GGC TAC ATC TCG GGG TAC TTC GTG ACC GAC CCG GAG CGT CAG GAG GCG GTC 
5.DXGYISGVFVTDPERQEAV 

963/321 993/331 

CTG GAG GAC CCC TAC ATC CTG CTG QTC AGC TCC AAG GTG TCC ACT GTC AAG GAT CTG CTG 
LED? y IL LVSSXVSTVKDLL 

1023/341 1053/351 

CCG CTG CTC GAG AAG GTC ATC GGA GCC GCT AAG CCG CTG CTG ATC ATC GCC GAG GAC GTC 
PLLEKVIGAGKPLLIIASDV 

1083/361 1113/371 

GAG GGC GAG GCG CTG TCC ACC CTG GTC CTC AAC AAG ATC CGC GGC ACC TTC AAG TCG GTG 
BQEALSTLVVHKrRGTFKSV 

1143/381 1173/391 

GCG GTC AAG OCT CCC GGC TTC GGC GAC CGC CGC AAG GCG ATG CTG CAG GAT ATG GCC ATT 
XVKAPG FGDRRKAMLQDMAI 

1503/401 - 1233/411 

CTC ACC GGT GGT CAG OTG ATC AGC GAA GAG QTC GGC CTG ACG CTG GAG AAC GCC GAC CTG 
LTGGQVISEEVGLTLENAPI- 
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1263/421 1293/431 

TOG CTG CT A GGC AAG GCC CGC AAG CTC GTG GTC ACC AAG GAC GAG ACC ACC ATC GTC GAG 

SLLGXARKVVVTKDETTIV^ 
1323/441 1353/451 

GGC GCC GGT GAC ACC GAC GCC ATC GCC GGA CGA GTG GCC CAG ATC CGC CAG GAG ATC GAG 
GAGDTDA1AGRVAQIKQEI E 
1383/461 1413/471 

AAC AGC GAC TCC GAC TAC GAC CGT GAG AAG CTG CAG GAG CGG CTG GCC AAG CTG GCC GGT 

NSUSDYDREKLQERLAKLAG 
1443/481 1473/491 

GGT GTC GCG CTG ATC AAG GCC GGT GCC GCC ACC GAG GTC GAA CTC AAG GAG CGC AAG CAC 
GVAVIXAGAATSVELKERKH 
1503/501 1533/511 

CGC ATC GAG GAT GCG GTT CGC AAT GCC AAG GCC GCC GTC GAG GAG GGC ATC GTC GCC GGT 
RIEDAVRNAKAAVEEGIVAG 
1553/521 1593/531 

GGG GGT GTG ACG CTG TTG CAA GCG GCC CCG ACC CTG GAC GAG CTG AAG CTC GAA GGC GAC 
GGVTLLQAAPTLDELKLEGD 
1623/541 1653/551 

GAG GCG ACC GGC GCC AAC ATC GTG AAG GTG GCG CTG GAG GCC CCG CTG AAG CAG ATC GCC 
EATGANIVKVALEAPLKgi A 
1583/561 1713/571 

TIC AAC TCC GGG CTG GAG CCG GGC GTG GTG GCC GAG AAG CTG CGC AAC CT3 CCG GCT GGC 
FNS G LEPCVVAEKVRNLPAG 
1743/58! 1773/591 

CAC GGA CTG AAC GCT CAG ACC GGT GTC TAC GAG GAT CTG CTC GCT GCC GGC GTT GCT GAC 
H C I# N A Q TGVYG D L L A A G V A D 
1803/601 1833/611 

CCG GTC AAG GTG AOC OCT TCG GCG CTG CAG AAT GCG GCG TCC ATC GCG GGG CTG TIC CTG 
PVKVTRSALQNAASIAGLF L 
1863/621 1893/631 

ACC ACC GAG GCC G1C GTT GCC GAC AAG CCG GAA AAG GAG AAG GCT TCC GTT CCC GGT GGC 

TTEAVVADXPEKEKASVPGG 

1923/641 

GOC GAC ATG GGT GGC ATG GAT TTC TGA 

GDMGGHDF* j-jq ^ 
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3/1 33/11 

ATG GCC AAG ACA ATT GCG TAC GAC GAA GAG GCC CGT CGC GGC CTC GAG COG GGC TTG AAC 

63/21 93/31 u 

GCCreGCCGATGCGCHAAAGGTCACATTGCXXCCC^ 

123/41 153/51 * » V V L E 

AAG AAG TGG GGT GCC CCC ACG ATC ACC AAC GAT GGT GTG TCC ATC GCC AAG GAG ATT rjir 

KKWGAPTITNDGVSIAket p 
1B3/61 213/71 * E 1 E 

CTG GAG GAT CCG TAC GAG AAG ATC GGC GCC GAG CTC GTC AAA GAG GTA CCC AAG AAG ACC 

^,,L DPYEKrGAELVKEVAK KT 
243/81 273/91 

GAT GAC GTC GCC GGT GAC GGC ACC ACG ACG GCC ACC GTG CTG GCC CAG GCG TTG GTT CGC 

DDVAGDGTTTATVLAQALVR 
303/101 333/111 

GAG GGC CTG CGC AAC GTC GCG GCC GGC GCC AAC CCG CTC GGT CTC AAA CGC GGC ATC oaa 

EGLRNVAAGANPLGLKRGI E 
363/121 393/131 

AAG GCC GTO GAG AAG GTC ACC GAG ACC CTG CTC AAG GGC GCC AAG GAG GTC GAG ACC AAG 

KAVEKVTSTLLKGAKEVETK 
423/141 453/151 * 

GAG CAG ATT GOG GCC ACC GCA GCG ATT TCG GOG GGT GAC CAS TCC ATC GGT GAC CTG ATC 

EQ1AATAAISAGDQSIGDLX 
483/161 513/171 

GCC GAG GCG ATG GAC AAG GTG GGC AAC GAG GGC GTC ATC ACC GTC GAG GAG TCC AAC ACC 

AEAMDKVGNEGVITVEESNT 
543/181 573/191 

TTT GGC CTG CAG CTC GAG CTC ACC GAC GGT ATG CGG TTC GAC AAG GGC CAT ATG CAT OGA 

FGLQLELTEGMRFDKGKMHG 
603/201 633/211 

GAT ACA OCT ACA TTG CAT GAA TAT ATG TTA GAT TTG CAA CCA GAG ACA ACT GAT CTC TAC 

DTPTLHEYMLOLQPETTDLY 
663/221 693/231 

TGT TAT GAG CAA TTA AAT GAC AGC TCA GAG GAG GAG GAT GAA ATA GAT GGT CCA GCT GGA 

CYEQLi/DSSEEEDElDGPAG 
723/241 7S3/251 

CAA GCA GAA CCG GAC AGA GCC CAT TAC AAT ATT GTA ACC TTT TGT TGC AAG TGT GAC TCT 

QAEPDRABYNIVTFCCKCDS 
733/261 813/271 

ACG CTT COG TTG TGC GTA CAA ACC ACA CAC GTA GAC ATT CGT ACT TTG GAA GAC CTG TTA 

TLRLCVQST .HVDlRTLEDLL 
843/281 873/291 

ATG GGC ACA CTA GGA ATT GTG TGC CCC ATC TGT TCT CAG AAA CCA TAA 
MGTLGZVCPICSQXP* 
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108/1 _ mmr- rv-ji CTC GAG ScMG ATT CTC GTG CAG GCC AAC GAS GCC 



1° 8/1 _ _ ^ AAG CCA CIC GAG GAC AAtt A.i w~ ^ ~- 

ATG GCG AAG GTG AAC ^TC AW* ^ ^ p Q R x L v q a N E A 

M AK vNl ~ 198/31 

B T T T A S G L V j58/si 

238/41 ^ ^ m « « CM WS <M CM G« CK AM Wl COB W «8 

gcgaccgtcgttgccgtccwcctggcc s d _ 2 K R - p 

238/61 „ _,_ . cc ~c ATC TA C AGC AM TAC CTC G=C ACC GAG ATC AAG 

CT3 GAC OTT GCG GAG GST GAC ACC CTC A«- - K v C 8 T E X K 

L DVAS cDT ' 378/9L 

348/81 „_ . —tt. »«-r; ecA CGC GAC GTG CTQ GCC GTC GTT TCC MO 

TAC AAC GGC GAG GAA TAC CTG ATC CTG TCG GCA COC ^ ^ ^ A v v S K 

408/101 ..^^TTCSICa TAt'aTC TTA GAT TTG CAA CCA GAG ACA ACT 

ATG CAT GGA GAT ACA CCT ACA TTG CnX GAA T/u ^ ^ Q L Q ? E T T 

*68/121 _ M . GAC AGC TCA GAG GAG GAG GAT GAA ATA GAT GOT 

GAT CTC TAC TGT TAT GAG CAA TTA AAT GAC AGC TCA ^ £ ^ q E 1 D G 

O L V C ^ S Q 558/151 

528/141 GCC CAT TAC AAT ATT GTA ACC TTT TGT TGC AAG 

CCA GCT GGA CAA CCA GAA CCG G-C AGA GCC W ^ N r V T P C C K 

P A G Q A 618/171 



583/161 IOC GTA CAA AGC ACA CAC GTA GAC ATT CGT ACT TTG 

<TGT GAC TCT ACG CTT CGG TTG TGC OTA v D I R T L E 
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3/1 K' 11 

atg gat gga gat aca cct aca ctg cac gaa cat arg tta gat ttg caa cca gag aca act 



MDGDTPTLH 
63/21 53/31 



gat etc tac tgt tat gag caa tta aat gac age tea gag gag gag gat gaa ata gat ggc 
DLYCYEQLND95EEEDSIDG 

123/41 153/51 

cca get gga caa gca gaa ccg gac aga gee cat tac aat att gta acc ttt tgt tgc aag 
PAGQAEPDRAHYNIVTFCCK 

183/61 213/71 

cat gac tct acg ctt egg teg tgc gta caa age aca cac gta gac ate cgt act ttg gaa 
C DSTLR&CVQSTHVDXRTLE 

273/91 

gae'etg tta atg ggc aca eta gga act gcg tgc ccc ate tgt tct cag aaa cca gec atg 
DLLMGTLGIVCPICSOKPAM 

303/101 333/111 

gCT CGT GOG GTC GGG ATC GAC CTC GGC ACC ACC AAC TCC GTC GTC TCG GTT CTG GAA GGT 
A R A V GIOLGTTNSVVSVLEG 
363/121 393/131 

MC GAC CCG GTC GTC C7TC GCC AAC TCC GAG GGC TCC AGG ACC ACC CCG TCA ATT C7TC GCG 
CDPVVVAM S EG S RTTP S I VA 
423/141 453/151 

TIC GCC CGC AAC GGT GAG GTG CTG GTC GGC CAG CCC GCC AAG AAC CAG GCG CTG ACC AAC 
FARN GBVLVGQPAKNQAVTN 

513/171 

CTc'gAT CGC ACC GTG CGC TCG GTC AAG CGA CAC ATC GGC AGC GAC TGG TCC ATA GAG ATT 
V 0 RTVRSVKRHMGSDWSIEI 
543/181 573/191 

GAC GGC AAG AAA TAC ACC GCG CCG OAG ATC AGC GCC CGC ATT CTO ATG AAG CTG AAG 

KKyTA pElSARILHKLKR 

fim/7ni 633/211 

GAC GCC GAG GCC TAC CTC GGT GAG GAC ATT ACC GAC GCG GTT ATC ACG ACG CCC GCC TAC 
DAEAYI.OEDITDAVITTPAY 

693/231 

TTC AAT GAC GCC CAG CGT CAG GCC ACC AAG GAC GCC GGC CAG ATC GCC GGC CTC AAC GTC 
F 5 I A Q R Q A T K Q I A G L N V 

mi/DM 753/251 

cS cn ATC GTC AAC GAG CCG ACC GCG GCC GOG Ctg gee TAC GGC CTC GAC AAG GGC GAG 
T^RlVNEPTAAALAYGLDKGt, 

7A1/261 813/271 

AAC GAG CAG CGA ATC CTG GTC TTC GAC TTG GGT GGT GGC ACT TTC GAC GTX TCC CTG CTG 
Q R x L V r D L G^G T F D V S L L 

^aS GGC GAG GGT GTG GTT GAG GTC CGT GCC ACT TCG GGT GAC AAC CAC CTC GGC GGC 
EI GE oVVEVRATSGDNHLGG 

GAc'gAC TGG GAC CAG CGG GTC GTC GAT TGG CTG GTG GAC AAG TTC AAG GGC ACC AGC GGC 
DDWDQRVVOWLV OKPKGTSG 

qm/W 993/331 

ATC GAT CTG ACC AAG GAC AAG ATG GCG ATG CAG CGG CTG CGG GAA GCC GCC GAG AAG GCA 
IDLTKDKMAMQRLREAAEXA 
ift^/^ai 1053/351 

AAG ATC GAG CTG AOT TCG AGT CAG TCC ACC TCG ATC AAC CTG CCC TAC ATC ACC GTC CAC 
KIELSSSQSTSINL PYITVU 

1113/371 

GCC GAC AAG AAC COG TTG TTC TTA GAC GAG CAG CTG ACC CGC GCG GAG TTC CAA CGG ATC 
ADKNP L7Lt>EQLTRAEFQRl 
lldl/lfll 1173/391 

ACT CAG GAC CTG CTG GAC CGC ACT CGC AAG CCG TTC CAG TCG GTG ATC GCT GAC ACC GGC 
T QDLLORTRKPFQSVIAPT^ 

:- n ., in1 . 1233/411 

ATT TCG GTG TCG GAG ATC GAT CAC GTT GTG CTC CTG GGT GGT TCG ACC CGG ATG CCC GCG 
XSVSEIDHVVLVGGSTRMPA 
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1263/421 1293/431 

GTC ACC GAT CTG QTC AAG CAA CTC ACC GGC GGC AAG CAA CCC AAC AAC CCC GTC AAC CCC 
VTDLVKELTGGKEPNKGVN? 
1323/441 1353/451 

GAT GAG CIT GTC GCG CTC CCA GCC GCT CTG CAG GCC GGC CTC CTC AAG GGC GAG GTG AAA 

D EVVAVGAAL»QAGVLKGEVK 
1383/461 1413/471 

GAC GTT CTG CTG CTT GAT GTT ACC CCG CTG ACC CTG GGT ATC GAG ACC AAG GGC GOG GTG 
DVLLI»DVT PL S LGI ETKGGV 
1443/481 1473/491 

ATG ACC AGG CTC ATC GAG CGC AAC ACC ACG ATC CCC ACC AAG CGG TCG GAG ACT TTC ACC 
MTRLI ERNTTI PTKRSETFT 
1503/501 1S33/511 

ACC GCC GAC GAC AAC CAA CCG TCG CTG CAG ATC CAG GTC TAT CAG GGG GAG CGT GAG ATC 
TADDNQPSVQ I QVY QGERE I 
1563/521 1593/531 

GCC GCG CAC AAC AAG TTG CTC GGC TCC TTC GAG CTG ACC GGC ATC CCG CCG GCG CCG CGG 
AAHNKLLGSFELTGIPPAPR 
1623/541 1653/551 

CGG ATT CCG CAG ATC GAG GTC ACT TTC GAC ATC GAC GCC AAC GCC ATT GTG CAC GTC ACC 
GX P Q IEVTFD IDANGXVHVT 
1683/561 1713/571 

GCC AAG GAC AAG GGC ACC GGC AAG GAG AAC ACG ATC CGA ATC CAG GAA GGC TCG GGC CTG 
AKDKGTGXENTIRIQEG SGt* 
1743/581 1773/591 

TCC AAG GAA GAC ATT GAC CGC ATG ATC AA& GAC GCC GAA GCG CAC GCC GAG GAG GAT CGC 
SKEDIDRMIKDAEAHAEEDR 
1803/601 1833/611 

AAG CGT CGC GAG GAG GCC GAT GTT CGT AAT CAA GCC GAG ACA TTG GTC TAG CAG ACG GAG 
KRREEADVRNQAETI/VYQTE 
1863/621 1893/631 

AAG TTC GTC AAA GAA CAG CGT GAG GCC GAG GOT GGT TCG AAG GTA CCT GAA GAC ACG CTG 
KFVKEQREAEGGSKVPEDTL 
1923/641 1953/S51 . 

AAC AAG GTT GAT QCC GCG GTG GCG GAA GCg AAG GCG GCA CTT GGC GGA TCG GAT ATT TCG 
NKVDAAVAEAKAALGGSJDIS 
1983/661 2013/571 

GCC ATC AAG TCC GCG ATG GAG AAG CTG GGC CAG GAG TCG CAG GCT CTG GGG CAA GCG ATC 
A1KSAMEKLGQESQALCQAI 
2043/581 2073/691 

TAC GAA GCA GCT CAG GCT GCG TCA CAG GCC ACT GGC GCT GCC CAC CCC GGC GGC GAG CCG 
YEAAQAASQATCAAHPGGEP 

2103/701 2133/711 
GGC GGT GCCCACCCCCXXTCGGCTGAGCTAGCATGA 
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atg gac gga gat aea cct aca ttg cat gaa tac acg tta gat ttg caa cca gag aca 

MDGDTPTLHEYHLDLQPET 
63/21 93/31 

gat etc tac tgt tat gag caa tta aat gac age tea gag gag gag gat gaa ata gat get 

OLYCYEQLNDSSEEEDETDG 
123/41 153/51 

cca get gga caa gca gaa ccg gac aga gec cat tac aat att gta acc ttc tgt tgc aag 

PAGQAEPDRAHYNIVTFCCK 
183/61 213/71 

tgt gac tct ncg ctt egg ttg tgc gta caa age aea cac gta gac att cgt act ttg gaa 

CDSTLRLCVQSTKVDIRTLE 
243/81 273/91 

gac ccg tta atg ggc aca eta gga att gtg tgc ccc ate tgt. tct cag aaa cca gec atg 
DLLMCTLGIVCPICSQKPAM 
303/101 333/111 

gCT CGT GCG GTC QGG ATC GAC CTC GGG ACC ACC AAC TCC CTC GTC TCG GTT CTG GAA GGT 

ARAVGIDLGTTNSVVSVLEG 
363/121 393/131 

GGC GAC CCG GTC CTC GTC GCC AAC TCC GAG GGC TCC AGG ACC ACC CCG TCA ATT GTC GOG 
GDPVVVANSEGSRTTPSIVA 
423/141 453/1S1 

TTC GCC CGC AAC GGT GAG GTG COT GTC GGC CAG CCC GCC AAG AAC CAG GCG GTG ACC AAC 
PARNG EVLVGQPAKNQAVTN 
4B3/161 513/171 

GTC GAT CGC ACC GTG CGC TCC GTC AAG CGA CAC ATG GGC AGC GAC TCG TCC ATA GAG ATT 
VDRTVRSVKRHMGSDWSIEI 
543/181 573/191 

GAC GGC AAG AAA TAC ACC 005 CCG GAG ATC AGC GCC CGC ATT CTG ATG AAG CIO AAG COC 
DGKKYTAPEISARILKKLKR 
603/201 633/211 

QAC GCC GAG GCC TAC CTC GGT GAG GAC ATT ACC GAC GCG GTT ATC ACG ACG CCC GCC TAC 
DAEAYLGEDITDAVITTPAY 
663/221 693/231 

TTC AAT GAC GCC CAG CGT CAG GCC ACC AAG GAC GCC GGC CAG ATC GCC GGC CTC AAC GTG 
FNDAQRQATKDAGQIAGONV 
723/241 753/251 

CTG CGG ATC GTG AAC GAG CCG ACC GCG GCC GCG ctg gec TAC GGC CTC GAC AAG GGC GAG 
LRIVNEPTAAALAYGLDKGE 
783/261 813/271 

AAG CAG CAC CGA ATC CTC CTC TTC CAC TTG OCT GGT GGC ACT TTC GAC GTT TCC CTG CTG 
KEQR I LVFDLGGCTFDVSLL 
843/281 873/291 

GAG ATC GGC GAG GOT GTG GTT GAG GTC CGT GCC ACT TCG GGT GAC AAC CAC CTC GGC GGC 
EIGECVVJ5VRATSGDNKLGG 
903/301 933/311 

GAC GAC TOG GAC CAG CGG CTC GTC GAT TQG CTG GTC GAC AAG TTC AAG GGC ACC AGC GGC 
DDWDQRVVDWLVDKFKGTSG 
963/321 993/331 

ATC GAT CTG ACC AAG GAC AAG ATG GCG ATG CAG CGG CTG CGG GAA GCC GCC GAG AAG GCA 
TDLTXDKMAMQRLREAAEKA 
1023/341 1053/351 

AAG ATC GAG CTC AGT TCG AGT CAG TCC ACC TCG ATC AAC CTG CCC TAC ATC ACC GTC GAC 
Kl EL SSSQ5TSINLPYZTVD 
1083/361 1113/371 

GCC GAC AAG AAC CCG TTG TTC TTA GAC GAG CAG CTG ACC CGC GCG GAG TTC CAA CGG ATC 
ADXNPLFLDEQLTRAEFQHI 
1143/381 1173/391 

ACT CAG GAC CTG CTG GAC CGC ACT CGC AAG COG TTC CAG TCG GTG ATC GOT GAC ACC GGC 
TQDL LDRTRKPFQSVIADTG 
1203/401 1233/411 

ATT TCG GTG TCG GAG ATC GAT CAC GTT GTG CTC CTG CCT CGT TCG ACC CGG ATG CCC GCG 
ISVS BIDKVVLVGGSTRMPA 
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GTG ACC GAT CTG GTC AAG GAA CTC ACC GGC GGC AAG cii /vr 

V T D L V K S L TGGK ^f ^ 0 ^^^^^ 
1323/441 1353/451 P * * G V * p 

GAT GAG GTT GTC GCG GTG GGA GCC GCT CTG CAC GCC GGC CTC err j»n 

1383/461 J, 13 * 7l « L K G E v 



GAC GTT CTC CTG CTT GAT GTT ACC CCC CTG AGC CTG GGT ATC r\r arr 

1443/481 1473/491 1 £ ? * C C V 

ATG ACC AGS CTC ATC GAG CGC AAC ACC ACG ATC CCC ACC AAC rrv* tw~ ~ 
?S03TS0X R ^ 8 R N T T fj £f 

1563/521 U r S93/531 ° E H E 1 

GCC GCC CAC AAC AAG TTG CTC GGG TCC TTC GAG CTC ACC CGC VTT rr-r- /-^- 

A A H N.K L L G S F E l t ct ^ CCCCGCCCCQOGG 
1633/341 1653/5S1 ' ^ 

^MCAfiJTC^^^^^^^ ^^^^^ 

1683/561 1713/571 I V H V T 

1743/581 [iTl/Ml 

1803/601 1833/611 E E D R 

AAQ CCT COC GAG GAG GCC GAT CTT CGT AM? CAA act? cap \r-x - _ 

K R R S B A D V R N ^^^^^^f^f 0 ^ 5 
il^ 3 ^ 1 ^ ? B3 3/631 L V Y Q T E 

-SU43/681 2073/591 

2103/701 * v * ri ^«PGCEP 

2163/721 ViJ AEVVDDG 
OX5 GAG GCC AAC TGA 

R E A K * FIG. 14B 
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acg gCA AAA GAA ATT AAA TTT tca tca gat ccc cgt tca gct atg gtc cgt got gtc gat 

MAKEIKFS SOARSAMVRGVD 
53/21 53/31 ' 

ATC CTT GCA GAT ACT GTT AAA GTA ACT TTG GGA CCA AAA GGT CGC AAT GTC GTT CTT CAA 

ILADTVKVTLGPXGRNVVLB 
123/41 153/51 

AAG TCA TTC GGT TCA CCC TTG ATT ACC AAT GAC GGT GTG ACT ATT GCC AAA GAA AIT CAA 
KSFGSPLITNOGVTIAKEI E 
183/61 213/71 

TTA GAA GAC CAT TTT GAA AAT ATG GGT GCC AAA TTG GTA TCA GAA GTA GCT TCA AAA ACC 
L20RFSNM0AXLVSBVA5XT 
243/81 273/91 

AAT GAT ATC GCA GGT GAT GGA ACT ACA ACT OCA ACT GTT TTG ACC CAA GCA ATC GTC CGT 
N D I A G DGT T TAT VLTQA I V R 
303/101 333/111 

GAA GGA ATC AAA AAC GTC ACA GCA GGT GCA AAT CCA ATC GGT ATT CGT CGT OGG ATT GAA 
SGIXNVTAGANPIGIRRGI E 
363/121 393/131 

ACA GCA GTT GCC GCA GCA GTT GAA GCT TTG AAA AAC AAC GTC ATC CCT GTT GCC AAT AAA 
TAVAAAVEALKNNVIPVANK 
423/141 453/151 

GAA GCT ATC GCT CAA GTT GCA GCC GTA TCT TCT CGT TCT GAA AAA GTT GGT GAG TAC ATC 
EAIAQVAAVSSRSEKVCEY I 
433/161 513/171 

TCT GAA GCA ATG GAA AAA GTT GGC AAA GAC CGT GTC ATC ACC ATC GAA GAG TCA CGT GGT 
SEAWEKVGKDGVITISESRG 
S43/181 573/191 

ATG GAA ACA GAG CTT GAA GTC GTA GAA GGA ATG CAG TTT GAC CGT GGT TAC CTT TCA CAG 
METELEVVEGMQFDRGYl* S Q 
603/201 633/211 

TAC ATG GTG ACA GAT AGC GAA AAA ATG GTG GCT GAC CTT GAA AAT CCG TAC ATT TTG ATT 
YMVTDSEKMVADLENPYI h 1 
S63/221 693/231 

ACA GAC AAG AAA ATT TCC AAT ATC CAA GAA ATC TTG CCA CTT TTG GAA AGC ATT CTC CAA 
TDKKXSNIQEILPLLESI L Q 
723/241 753/251 

ACC AAT CGT CCA CTC TTG ATT ATT GCG GAT GAT GTG GAT GGT GAG GCT CTT CCA ACT CTT 
S N R P L L I IADDVDGEAl/PT L 
733/261 813/271 

GTT TTG AAC AAG ATT CGT GGA ACC TTC AAC GTA OTA GCA QIC AAG GCA CCT GGT TTT GGT 
VLNKIRGTFNVVAVKAPGFG 
843/281 873/291 

GAC CGT CGC AAA GCC ATG CTT GAA GAT ATC GCC ATC TTA ACA GGC GGA ACA GTT ATC ACA 
DRRKAMLEDIAILTGGTVIT 
903/301 933/311 

GAA GAC CTT GGT CTT GAG TTG AAA GAT GCG ACA ATT GAA GCT CTT GGT CAA GCA GCG AGA 
EDLGLELKDATIEALGQAAR 
963/321 993/331 

GTG ACC GTG GAC AAA GAT AGC ACC GTT ATT GTA GAA GGT GCA GGA AAT CCT GAA GCG ATT 
VTVDKDSTVIVEGAGNPEAI 
1023/341 1053/351 

TCT CAC CGT GTT GCC GTT ATC AAG TCT CAA ATC GAA ACT ACA ACT TCT GAA TTT GAC CGT 
SHRVAVIKSQIZTT TSE FDR 
1083/361 1113/371 

GAA AAA TTG CAA GAA CGC TTG GCC AAA TTG TCA GGT GGT GTA GCG GTT ATT AAG GTC GGA 
EKLQERLAKLSGGVAVIKVG 
1143/381 1173/391 

GCC GCA ACT GAA ACT GAG TTG AAA GAA ATG AAA CTC CGC ATT GAA GAT GCC CTC AAC GCT 
AATETELKEMKLR1EDALNA 
1203/401 1233/411 

ACT CGT GCA GCT GTT GAA GAA GGT ATT- GTT CCA GCT GGT CGA ACA GCT CTT GCC AAT GTG 
TRAAVEEGIVAGGGTALANV 
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AST CCA GCT GTT GCT ACC TTC GAA TTG ACA OCA CAT GAA GCA ACA GGA COT AAT AT? GTT 

IPAVATLELTGDEATGRNIV 
1323/441 1353/451 

CTC CQT GCT TTG GAA GAA CCT GTT CGT CAA AIT OCT CAC AAT OCA GGA TTT GAA GGA TCT 

URALEEPVRQIAHNAGPEGS 
1393/461 1413/471 

ATC GTT ATC GAT COT TT3 AAA AAT GCT GAG CTT CCT ATA GGA TTC AAC GCA GCA ACT GGC 
I V I DRLKNAELGIGFNAATG 
1443/481 1473/491 

GAG TOG GTT AAC ATC ATT CAT CAA GOT ATC ATT GAT CCA GTT AAA GTG AGT CGT TCA GCC 

EWVNMIDCGIIDPVKVSRSA 
1503/501 1533/511 

CTA CAA AAT GCA OCA TCT CTA GCC AGC TTG ATT TTC ACA ACA GAA GCA GTC GTA GCC AAT 
LQNAASVASLI LTTBAVVAN 
1563/521 1593/531 

AAA CCA GAA CCA GTA GCC CCA GCT CCA GCA ATG GAT CCA AGT ATG ATG GGT GGA ATG GGC 

KPEPVAPAPAHDPSMMGGMG 
1623/541 1653/551 

GGA GCT AGC atg cat gga gat aca ccc aca ccg cac gaa tat atg tta gat ttg caa cca 
GASMHGDTPTLHEYMLOLQP 
1683/561 1713/571 

gag aca act gac etc tac cgc tat gag caa tta aat gac age tea gag gag gag- gat gaa 

ETTDLYCYEQLNDSSEEEDE 
1743/581 1773/591 

ata gat gc/t cca get gga caa gca gaa ccg gac aga gee cat tac aat att gta ace ttt 
IDGPAGQAEPDRAHYNIVTP 
1803/501 1833/611 

tgt tgc aag tgt gac cct acg ctt egg ttg tgc gca caa age aca cac gca gac att egt 

CCJCCDSTLRLCVQSTHVDIR 
1863/621 1B93/631 

act ttg gaa gac ctg tea atg ggc aca eta gga act gtg tgc ccc ate tgt tct cag aaa 

TLEDLLMGTLGIVCFICSQK 

1923/641 

* ca ™ FIG. 15B 



SUBSTITUTE SHEET (RULE 26) 



WO 01/04344 



PCT/US00/18828 



23/37 

4/1 34/11 

ATC AAA GAG CTC AAG TTC GGT GTC GAA GCC CCT GCT CAG CTC CTC AAG GGT GTT GAC ACT 
MXSLKFGVSA^AQLLKGVDT 
64/21 94/31 

CTG GCC AAG GCC GIG ACT TCG ACT CTT GGT CCT AAG GCT CGT AAC GTC CTT ATC GAG TCT 
LAKAVTSTLGPKGRNVLIES 
124/41 154/51 

CCC TAT GGC TCC CCT AAG ATC ACC AAG GAT GGT CTC TCT GTT GCC AAG GCC ATC ACT CTC 
PYGSPKITKDGVS VAKAITL 
184/61 214/71 

CAA GAC AAG TTC GAG AAC CTC GGT GCT CGC CTC CTC CAG GAT GTC GCT TCT AAG ACC AAC 
QOKFSNLGARLLQDVASKTN 
244/81 274/91 

GAG ATT GCT GGT GAC GGT ACC ACC ACC CCT ACC GTC CTT GCC CGT GCC ATC TTC TCT GAG 
EIAGDGTTTATVLARAIPSE 
304/101 334/111 

ACC GTG AAG AAT GTT GCT GCT GGC TGC AAC CCC ATG GAT CTG CGC CGC CGT ATC CAG GCT 
TVKNVAAGCNPMDLRRGI Q A 
364/121 394/131 

GCT GTT GAT CCT GTC GTC GAC TAC CTC CAG AAG AAC AAG CGT GAC ATC ACC ACC GGT GAG 
AVDAVVDYLQKN3CRDITTG B 
424/141 454/151 

GAG ATC GCT CAG GTT GCT ACT ATC TCC GCT AAC GGT GAC ACC CAC ATT GGT AAG CTO ATC 
ElAQVATISANGDTHIGKIil 
484/161 514/171 

TCC ACC GCC ATG CAG CGT GTT GGC AAG GAG GGT GTC ATC ACT GTC AAG GAG CCC AAC ACC 
STAMERVGKEGVITVKEGKT 
544/181 574/191 

ATT GAG GAT GAG CTC GAG GTC ACT GAG GOT ATG CGC TTC GAC CGT GGA TAC ACC TCC CCC 
XZDBLEVTBGMRPORGYTS P 
604/201 634/211 

TAC TTC ATC ACC GAT ACC AAG TCC CAG AAG GTT GAG TTC GAG AAG CCT CTG ATT CTG CTG 
ypiTDTKSQKVEFSX PLILL 
664/221 694/231 

TCT GAG AAG AAG ATC TCT GCC GTT CAG GAC ATC ATC CCC GCC CTT GAG GCC TCC ACC ACC 
SEKKI SAVQDI I PAL EASTT 
724/241 754/251 

CTC CGC CGC CCC CTG GTT ATT ATC GGA GAG GAC ATT GAG GGT GAG GCT CTC GCC GTC TGC 
LRRPLVIIAEDXBGZALAVC 
784/261 814/271 

ATT CTG AAC AAG CTT CGT GGC CAG CTG CAG GTC GCT GCT GTC AAG GCT CCT GGA TTC GGT 
ILNKLRGQLQVAAVKAPGFG 
344/281 874/291 

GAC AAC CGC AAG AGC ATC CTG QGC GAT CTT GCC GTC CTT ACC AAC GGT ACC GTC TTC ACT 
DNRKSILGDLAVLTNGTVFT 
904/301 934/311 

GAT GAG CTC GAC ATC AAA CTC GAG AAG CTT ACC CCC GAT ATG CTT GGT TCC ACC GGC GCC 
OELDIKLEKLTPDKLGSTGA 

964/321 994/331 

ATC ACC ATC ACC AAG GAG GAC ACC ATC ATC CTG AAC GGG GAG GGC AGC AAG GAC GCC ATT 
ITITKEOTIILNGEQSKDAI 
1024/341 -1054/351 

GCC CAG CGC TGC GAG CAG ATT CGC GGT GTC ATG GOG GAC CCC AGC ACC TCC GAA TAC GAG 
AQRCEQIRGVMADPSTSEYE 
1084/361 1114/371 

AAG GAG AAG CTC CAG GAG CGT CTA GCT AAG CTC TCT GGC GGT GTT GCC GTC ATC AAG GTC 
KSKLQERLAKLSGGVAVIKV 

1144/381 1174/391 

GGT GGT GCC TCC GAG GTT GAG GTC GGT GAG AAG AAG GAC CGT GTT GTC GAT GCT CTC AAT 
GGASEVEVGEKKDRVVDALtt 
1204/401 " 1234/411 

gct acc cgt gct gct gtt gag gag got atc ctc ccc ggt ggt ggt acc ccc ctt ctc aag 
'atraaveegilpgggtallk 
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QCC GCC CCC AAC OGC CTT GAC AAT GTC AAG CCC GAG AAC TTC GAC CAG CAA CTC GOT CTC 
AAANGLDNVKPENFDQQLGV 
1324/441 1354/451 

AGC ATC ATC AAG AAT GCC ATC ACC CCC CCC GCT CGC ACC ATT GTT G\G AAC GCC GGC CTC 
SI IKNAI TRPART tVENAGL 
1384/461 1414/471 

GAG GGC AGC CTC ATT GTC GGC AAG CTC ACC GAC GAG TTC GCC AAG GAC TTC AAC CGC GGT 
EGSVIVGKLTDBFAKDFNRG 
1444/481 1474/491 

TTC GAC AGC TCC AAG GGC GAG TAC GTC GAC ATG ATC TCC AGC GGT ATC CTC GAT CCC CTC 
FDSSXGEYVDMISSGILDPL 
1504/501 1534/511 

AAG GTT GTT CGC ACC GCT CTG CTC GAC GCC AGC GGT GTC CCC TCC CTG CTC GOT ACC ACT 
KVVRTALLDASGVASLLGTT 
1564/521 1594/531 

GAG CTC GCT ATT GTT GAG GCC CCT GAG GAG AAG GGC CCC GCT GCT CCT GGC ATC GOT GGT 
EVAIVEAPEEKGPAAPGMGG 
1624/541 1654/551 

ATC GGT GGT ATG GGC GGC ATG GGC GGC ATG CAT GGA GAT ACA CCT ACA TTC CAT GAA TAT 
M GGMGGMGGMHGDT ftlkey 
1684/561 1714/571 

ATC TTA GAT TTC CAA CCA GAG ACA ACT GAT CTC TAC TGT TAT GAG CAA TTA AAT GAC AGC 
WLOLQPETTDLY CYEQLNDS 
1744/581 1774/591 

TCA GAG GAG GAG GAT GAA ATA GAT GGT CCA OCT GGA CAA GCA GAA CCG GAC AGA GCC CAT 
SEEEDEXDGFAGQAEFDRAH 

1804/601 1834/611 

TAC AAT ATT GTA ACC TTT TGT TCC AAG TGT GAC TCT ACG CTT CGG TTG TQC CTA CAA AGC 
YNIVTFCCKCDSTT.RLCVQS 
1864/621 1894/631 

ACA CAC GTA GAC ATT CCT ACT TTG GAA GAC CTG TTA ATG GGC ACA CTA GGA ATT CTG TGC 
THVDIRTLEDLLMGTLG IVC 
1924/641 

CCC ATC TCT TCT CAG AAA CCA TAG 

PICSQKP* «|gg 
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